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IHPROVED ADENOVIRUS AKD METHODS OF USE THEREOF 

This invention was supported by the National 
Institute of Health Grant No. P30 DK 47757. The United 
5 States government has rights in this invention. 

ri?ld 9f th^ Invgntion 

The present invention relates to the field of 
vectors useful in somatic gene therapy and the production 
10 thereof. 

Background of the Invention 

Hunan gene therapy is an approach to treating human 
disease that is based on the modification of gene 

15 expression in cells of the patient. It has become 
apparent over the last decade that the single most 
outstanding barrier to the success of gene therapy as a 
strategy for treating inherited diseases, cancer, and 
other genetic dysfunctions is the development of useful 

20 gene transfer vehicles. Eukaryotic viruses have been 

employed as vehicles for somatic gene therapy. Among the 
viral vectors that have been cited f recpiently in gene 
therapy research are adenoviruses. 

Adenoviruses are eukaryotic DNA viruses that can be 

25 modified to efficiently deliver a therapeutic or reporter 
transgene to a variety of cell types. Recombinant 
adenoviruses types 2 and 5 (Ad2 and Ad5, respectively) , 
which cause respiratory disease in hxmans, are currently 
being developed for gene therapy. Both Ad2 and Ad5 

30 belong to a subclass of adenovirus that are not 
associated with human malignancies. Recombinant 
adenoviruses are capable of providing extremely high 
levels of transgene delivery to virtually all cell types, 
regardless of the mitotic state. High titers (10^^ 

35 plaque forming units/ml) of recombinant virus can be 

easily generated in 293 cells (the adenovirus equivalent 
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to r.«ovirus p.-Jc.,in, cell lln..) ^^^^ 
«tena«l period, without .ppr«:lrtl. lo.."- *^ 

efficacy of thl. .y.t- in ^."v.'*"' » *!"'!::^i:„„ 
transgen. 1« Wvo that compl«.«.t. . g«»tlc i»b.l«w. 

. U« b.«. d«»n.tr.ted In «.1»- ^^^^^^^ 

(1986); K. Tanzaw. et al, z m IittUK . 114(1) 
(1980) t 3.I.. Golasten et al. WIHl , J. H*A x'^ 

10 iSsat.. a2=883-89, ,1993,,- and S. I*ib.-.i .t .1, J. 
ClUuInSaSt.. 22! 1885-1893 (1994)1. II>4e«>. • 
f .: lIL::t pllcatlo, defectlv. ^"-"^ ' 

CDHA for th. cyatlo flbro.1. t^.^'to h^«. cr 

(CFTB) ha. been approved for o» in at leart "° 

cllnleal trial. I..., -9: 

692 (Oct. 21, 1993,,. Further .upport of the »fety of 
re«»binant ad«»vlrus.s for 9«.. therapy I. the ^ 
Extensive exp«:l««. cf live adenovlm. vaccine. In hu«n 

populations. ^ 

Hu«an adenoviruse. are co.,prK»d of a 
approxlBately 36 tt. double-stranded DH» "^^'^^J' 
divided into 100 map unit, (..u.), each of ^ 
hp m length. The BH. contain, -.crt ^'"^^^^^^.^ 
rep«it. (ITR) at each end of th. genome that 
for viral DH» replication. The ge". product. «. 
org«.ized into early (El through E4) and 
L5) region., baaed on expression before or »^ 
initiation of viral OHA .ynthe.!. [.ee, e.g. , Borwita, 
i"" a » Tields, Raven Pre.., Ltd. , 

jjij2i2gi, 2d edit., ed. B. B. rieio , 

" f"::-l«.eration reconhinant, replicaticn- 

def icu:t Ueno^iruses which have been ^-loped f« ,«.e 
^„apy contain deletion, of - " 



20 



25 



35 



^r^b re;ro^.^i. ^Plication-defective viru. i. 
^o.^ on7n adenoviru-tran-fomed, conpleaentation hu«n 
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embryonic kidney cell line containing a functional 
adenovirus Ela gene which provides a transacting Ela 
protein, the 293 cell [ATCC CRL15731. El-deleted viruses 
are capable of replicating and producing infectious virus 
5 in the 293 cells, which provide Ela and Elb region gene 
products in trans. The resulting virus is capable of 
infecting meuiy cell types and can express the introduced 
gene (providing it carries its o%m promoter) , but cannot 
replicate in a cell that does not carry the El region DHA 

10 unless the cell is infected at a very high multiplicity 
of infection. 

However, in vivo studies revealed transgene 
expression in these El deleted vectors was transient and 
invariably associated with the development of severe 

15 inflammation at the site of vector targeting [S* 

Ishibashi et al, J. Clin. Invest ., £1:1865-1893 (1994); 
J. M. Wilson et al, Proe, Matl . Acad. Sei.. USA. ££:4421- 
4424 (1988); J. M. Wilson et al, Clin. Bio.. 1:21-26 
(1991) ; M. Grossman et al, Som. Cel l, and Mol- Gen.. 

20 12:601-607 (1991)]. One explanation that has been 

proposed to explain this finding is that first generation 
recombinant adenoviruses, despite the deletion of El 
gfenes, express low levels of other viral proteins. This 
could be due to basal expression from the unstimulated 

25 viral promoters or transact ivat ion by cellular factors. 
Expression of viral proteins leads to cellulau: immune 
responses to the genetically modified cells, resulting in 
their destruction and replacement with nontransgene 
containing cells. 

30 There yet remains a need in the art for the 

development of additional adenovirus vector constructs 
for gene therapy. 
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. novel rec<»ia>ln«.t «l«««lru. prodoetlon .y.t». On. 
cogent I. . «.«ttX. PU.»W, PM*. "^^.^'Z^ 
.d^ovlru. clB-.l»ant. n.c..ary for replioation and 

vector c«ri« . «l«=t«l tr.n.gene 
control of . ..l.ot.d pro«.t«: «.d oth«: convention.1 
v«tor/pl.«id reguLtory eo«>on.nt.. Th. oth« 
, eonpon.nt i. . h.lp.r .denoviru., «hlc. .Ion. or«i^. 
p.c^.,ln, cell lln., suppli.. .ufflci«.t ,.». ..,»««.. 
„.cs.«y for a productiv. vlr.1 i"""";- " [^^^^ 
preferr.4 «*odi«nt, th. h.lp« vlru. h.. b«m .lt.r«I 
L contain «dlf i«ti«- to th. natlv. 
5 Which dlr«t effici«rt padcglng, so » to 

dU^l. or "crlppl." th. pacxagln, function of th. h.lP« 
vlru. or it. ability to repllcat.. 

I„ «,oth« .sp.ct. th. pr.B.nt in,«.tlon Provld- • 
„co^in«,t .d.noviru.. an ^ "^'^^ ^ 
... of th. =o.pcn«.t. abova. "^"^ l'^^ 

coBpri... an .denoviru. capaid, ad«»vlru. 
necessary for replication ^ virion «>«P'l***^' 
i. deleted of .11 viral g«." (l-.- ^i""! ^ ^ 

ieadti^ fra».,. Thl. viru. partial, carrl.. . 
„ tran.,L und„ the control of a selected Pr-ter^nd 
other conventional vector regulatory 
MA recoi^tlnant vlru. 1= ch«:.cterl«a by "i^h tlt« 
:^an.g«.. delivery to a host cell and the ability to 
stably integrate the transgen. into the hort C.U 
,0 chroiso... xn one e-»dlB.nt, th. viru. crrl.. » it. 
transgen. a reports: g«». another «*odi-.nt of th. 
recombinant vlru. contain. . th«rap.utlc 

in «.oth.r a«»ct. th. lnv«.tlon provxd.. a m.thod 
for producing th. .bove-d.scrlb.d reco^lnant Adi viru, 
„ by c^transfectln, a cell lln. (either a paCcagln, cell 
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line or a non-packaging cell line) with a shuttle vector 
or plasmid and a helper adenovirus as described above, 
wherein the transf acted cell (generates the AdA virus. 
The AdA virus is subsequently isolated and purified 
5 therefrom* 

In yet a further aspect, the invention provides a 
method for delivering a selected gene to a host cell for 
expression in that cell by administering an effective 
amount of a recombinant AdA virus containing a 
10 therapeutic transgene to a patient to treat or correct a 
genetically associated disorder or disease. 

Other aspects and advantages of the present 
invention are described further in the following detailed 
description of the preferred embodiments thereof* 

15 

Brief Description of the Flcmrefl 

Fig. lA is a schematic representation of the 

organization of the major functional elements that define 

the 5' terminus from Ad5 including an inverted terminal 
20 repeat (ITR) and a packaging/ enhancer domain. The TATA 

box of the £1 promoter (black box) and ElA 

transcriptional start site (arrow) are also shown. 
Fig. IB is an expanded schematic of the 

packaging/ enhancer region of Fig. lA^ indicating the five 
25 packaging (PAC) domains (A-repeats) , I through V. The 

arrows indicate the location of PGR primers referenced in 

Figs. 9A and 9B below. 

Fig. 2 A is a schematic of shuttle vector 

pAdA.CNVLacZ containing 5' ITR from Ad5, followed by a 
30 CHV promoter /enhancer, a LacZ gene, a 3* ITR from Ad5, 

and remaining plasmid sequence from plasmid pSP72 

backbone. Restriction endonuclease enzymes are 

represented by conventional designations in the plasmid 

cons^bructs. 
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Fig. 2B is a schematic of the shuttle vector 
digested with EcoRI to release the modified AdA genome 
from the pSP72 plasmid baclcbone. « 

Fi,. 2C i. . .ch«»tic dH.l=tion of th. function of 
the vector i^. « the pre-«.c of « 
h.lp«: viru. M.CBhpM Which «»ode. . reporter .lnx9«« 
for huBMi plecente .llcallne phoephetaee (hp»P) , the 
Mi.CH»L.cZ I. pe=Kag«. Into P''"";'^^;;^'^^"^. 

cpeid., dUtingulshebl. fro. the heXp«: virion, by th. 
preMnc of the L.cZ gene. 

M,.. 3X to 3F [SEQ ID »o: U report the 
strand of the doubl.-.tr.nded pl«.id pM4.CMVL.eZ. The 
c!^le.ent.ry .equenc. «y be readily obtained by on. of 

..iU in the art. «^«.=. /"""^^o. 

cponenf. 5- »d IT« (nucleotid.. /' 
1) , th. 5- «d ITR (nucleotide. 5496-5144 of SEQ ID HO. 
I CK» pro.»ter/«*»«»r (nucleotide. 5117-45S4 of SM 
ID hTiK SD/SX sequence (nucleotide. 4507-4376 of SM 
S Z- i l-cz ,ene (nucleotides 43.0-845 of SM ID »0, 
ai^" poly » «qu«.ce (nucleotide. .37-63, of SEQ ID 

Fig 4A is a schematic of shuttle vector 
pAdAc.CMVLacZ containing an Ad5 5- ITR and 3' ^'^ 
positioned head-to-tail, with a CHV -"^"--^-"f^^^;^ 
Lcz minigene immediately following the 5- ^J^'J^^^^ 
by a plasmid pSP72 (Promega) baclcbone. Restriction 
endonuclease enzymes are represented by conventional 

designations in the plasmid constructs. 

Fig 4B is a schematic depiction of the function of 

, the vector system Of Fig. 4A. ^» ^"'"^ 
virus Ad.CBhpAP. the circular pADAc.CMVLacZ shuttle 
vector sequence is packaged into virion heads, 
Tstinguilbl. from the helper virions by the presence 
of the LacZ gene. 



35 
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Figs. 5A to SP [SEQ ID NO: 2] report the top DNA 
strand of the doxible-stranded vector pAdAc.GMVLacZ. The 
complenentary eeguence may be readily obtained by one of 
skill in the art* The sequence includes the following 
5 components: 5* Ad ITR (nuclectides 600-958 of SEQ ID NO: 
2) ; C3fV promoter/enhancer (nucleotides 969-1563 of SEQ ZD 
NO: 2); SD/SA sequence (nucleotides 1579-1711); LacZ gene 
(nucleotides 1762-5236 of SEQ ID NO: 2); poly A sequence 
(nucleotides 5245-5443 of SEQ ID NO: 2); and 3V Ad ITR 
10 (nucleotides 16-596 of SEQ ID NO: 2). 

Fig. 6 is a schematic of shuttle vector pAdA.CBCFTR 
containing 5" ITR from Ad5, followed by a chimeric CHV 
enhancer/ fi act in promoter enhancer, a CFTR gene, a poly-A 
sequence, a 3* ITR from Ad5, and remaining plasmid 
15 sequence from plasmid pSLllSO (Pharmacia) backbone. 
Restriction endonuclease enzymes are represented by 
conventional designations in the plasmid constructs. 

Figs. 7A to 7H [SEQ ID NO: 3] report the top DNA 
strand of the double-stranded plasmid pAdA.CBCFTR. The 
20 complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
components: 5* Ad ITR (nucleotides 9611-9254 of SEQ ID 
NO: 3); chimeric CMV enhancer /fi actin promoter 
(nucleotides 9241-8684 of SEQ ID NO; 3) ; CFTR gene 
25 (nucleotides 8622-40iS5 of SEQ ID NO: 3); poly A sequence 
(nucleotides 3887-3684 of SEQ ID NO: 3); and 3" Ad ITR 
(nucleotides 3652-3073 of SEQ ID NO: 3). The remaining 
plasmid backbone is obtained from pSLllSO (Pharmacia) • 

Fig. 8A illustrates the generation of 5* adenovirus 
30 terminal sequence that contained PAC domains I and II by 
PCR. See, arrows indicating righthand and lefthand (PAC 
II) PGR probes in Fig. IB. 
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rig. 8B illustrates the generation of 5- terminal 
sequence that contained PAC domains 1. H, I" 
PCR. see, arrows indicating righthand and lefthand (PAC 

IV) PCR probes in Fig* 1B# ^ , 

5 Pig! 8C depicts the amplification products Bubcloned 

into the multiple cloning site of pAd.LinX.l (IHGT Vector 
core) generating pAd.PACII (domains I and II) and 
pAd.PACiV (domains I, II, HI, and IV) resulting in 
crippled helper viruses, Ad.PACII and Ad.PAClV with 
10 modified packaging (PAC) signals. 

Fig. 9A is a schematic representation of the 
subcloning of a human placenta alkaline phosphatase 
reporter minigene containing the immediate early CMV 
enhancer/ promoter (CMV) , human placenta alkaline 
phosphatase cDHA (hpAP) , and SV40 polyadenylation signal 
(PA) into PAd.PACII to result in crippled helper virus 
vector pAdA.PACII.CMVhpAP. Restriction endonuclease 
enzymes are represented by conventional designations in 
the plasmid constructs. 

Fig. 9B is a schematic representation of the 
subcloning of the same minigene of Fig. 9A into pAd.PACIV 
to result in crippled helper virus vector 
pAd.PACIV. CMV.hpAP. 

Fig 10 is a flow diagram summarizing the synthesis 
of an adenovirus-based polycation helper virus conjugate 
and its combination with a pAdA shuttle vector to result 
in a novel viral particle complex. CsCl band P-^ified 
helper adenovirus was reacted with the heterobifunctional 
crosslinker sulfo-SMCC and the capsid protein fiber is 
labeled with the nucleophilic maleimide moiety. Free 
sulfhydryls were introduced onto poly-L-lysine using 2- 
iHinothiolane-HCl and mixed with the labelled adenovirus, 
resulting in the helper virus conjugate Ad-pLys. A 
unique adenovirus-based particle is generated by 
purifying the Ad-pLys conjugate over a CsCl gradient to 



20 



25 



30 
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remove unincorporated poly-L-lysine^ followed by 
extensively dialyzing, adding shuttle plasmid DNAs to Ad- 
pLys and allowing the complex formed by the shuttle 
plasmid wrapped around Ad-pLys to develop. 
5 Fig. 11 is a schematic diagram of pCCL-DMD, which is 

described in detail in Example 9 below. 

Fig. 12 A - 12P provides the continuous DNA sequence 
of pAdA.CNVmDys [SEQ ZD NO:10]. 

10 Detailed Description of the Invention 

The present Invention provides a unique recombinant 
adenovirus capable of delivering transgenes to target 
cells, as well as the components for production of the 
unique virus and methods for the use of the virus to 

15 treat a variety of genetic disorders. 

The AdA virus of this invention is a viral particle 
containing only the adenovirus cis-elements necessary for 
replication and virion encapsidatioh (i.e., ITRs and 
packaging sequences) , but otherwise deleted of all 

20 adenovirus genes (i.e., all viral open reading frames). 

This virus carries a selected transgene under the control 
of a selected promoter and other conventional regulatory 
components, such as a poly A signal. The AdA virus is 
characterized by improved persistence of the vector DNA 

25 in the host cells, reduced antigenicity/ immunogenicity, 
and hence, improved performance as a delivery vehicle. 
An additional advantage of this invention is that the AdA 
virus permits the packaging of very large transgenes, 
such as a full-length dystrophin cDNA for the treatment 

30 of the progressive wasting of muscle tissue 

characteristic of Duchenne Muscular Dystrophy (DMD) • 

This novel recombinant virus is produced by use of 
an adenovirus-based vector production system containing 
two components: 1) a shuttle vector that comprises 

35 adenovirus cis-elements necessary for replication and 
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virion encapsidation and is deleted of all -ral gen^. 
Which vector carries a reporter or therapeuticjainigene 
and 2) a helper adenovirus which, alone or with a 
packaging cell line, i. capable of providing all of the 

5 viral gene products necessary xor a productive viral 
infection when co-transf ected with the shuttle vector 
preferably, the helper virus is modified so that it does 
not package itself efficiently. In thxs f 
desirably used in combination with a packaging <^-^^J^^ 

,0 that stably expresses adenovirus genes. ^« ^^^^^^^ 
producing this viral vector froB these components include 
both a novel means of packaging of an 

adenoviral/transgene containing vector into a virus, and 
a novel method for the subsequent separation of the 
15 helper virus from the newly formed recombinant vxrus. 



20 



X. The Shuttle veeter 

The Shuttle vector, referred to as pAdA, is composed 
of adenovirus sequences, and transgene sequences, 
including vector regulatory control sequences. 



Th» ad«noviru. nuolelc add uqurnxB of th. 
Shuttle v«=t«: provld. th. ilnlm- .d«.<.viru. .^«.c» 
Which .n.bl. . viral P«:ti=l. to b. produc«> wx^ th. 
„ ..siBt.no. Of a h.lp« viru.. "'""r'VlIr-t 
delivery of a r««>itoii>ant trM«g«« genome to a target 
cll W the resulting recombinant virus. 

The DHA sequence, of a nu.b«r of adenovirus 

types are available from Genbank. including type »d5 
,0 t^anx Accession NO. M„a6.,. •»» ^^"^ "^^^ 
U be Obtained f roB any to»n, adenovirus serotype, ^ 
a. serotype. 2, 3, 4, 7, 12 and «0, furthsr including 
any of th. pr«ently Idwitifled 41 huMn types I«e, 
e.g., Horvit., cit«l abov.). Similarly •^"'"'f™'" 
„ JoJn to infect other ani«l. ~y .1" " "P^'ved xn the 
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vector constructs of this Invention. The selection of 
the adenovirus type is not anticipated to limit the 
following invention. A variety of adenovirus strains are 
available from the American Type Culture Collection, 
5 Roclcville, Maryland, or availai^le by request from a 

variety of commercial and institutional sQurces. In the 
following exemplary embodiment an adenovirus, type 5 
(Ad5) is used for convenience. 

However, it is desirable to obtain a variety of 

10 pAdA shuttle vectors based on different human adenovirus 
serotypes. It is anticipated that a library of such 
plasmids and the resulting AdA viral vectors would be 
useful in a therapeutic regimen to evade ceillular, and 
possibly humoral, ixcoaunity, and lengthen the duration of 

15 transgene expression, as well as improve the success of 
repeat therapeutic treatments. Additionally the use of 
various serotypes is believed to produce recombinant 
viruses with different tissue targeting specificities. 
The absence of adenoviral genes in the AdA viral vector 

20 is anticipated to reduce or eliminate adverse CTL 

response which normally causes destruction of recombinant 
adenoviruses deleted of only the El gene. 

Specifically, the adenovirus nucleic acid 
sequences employed in the pAdA shuttle vector of this 

25 invention are adenovirus genomic sequences from which all 
viral genes are deleted. More specifically, the 
adenovirus secpiences employed are the cis<-acting 5* and 
3* inverted terminal repeat (ITR) sequences of an 
adenovirus (which function as origins of replication) and 

30 the native 5* packaging/enhancer domain, that contains 
sequences necessary for packaging linear Ad genomes and 
enhancer elements for the £1 promoter. These seq[uences 
are the sequences necessary for replication and virion 
encapsidation. See, e.g., P. Hearing et al, J. Virol.. 

35 £1(8) :2555-2558 (1987); M. Grable and P. Hearing, JL. 
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10 



15 



20 



25 



30 



YirsO./ fil(5) : 2D47-2056 (1990); and M. Grable and P. 
Hearing, i^jOXfil., ££(2) :723-731 (1992). 

According to this invention, the entire 
adenovirus 5' sequence containing the 5- ITR and 
packaging/enhancer region can be employed a. «»e 5^ 
adenovirus sequence in the pAdA shuttle vector. This 
left terminal (5-) sequence of the Ad5 genome useful xn 
this invention spans bp 1 to about 360 of the 
conventional adenovirus genome, also referred to as map 
units 0-1 of the viral genome. This sequence xs provided 
herein as nucleotides 5496-5144 of SEQ ID HO: 1, 
nucleotides 600-958 of SEQ ID HO: 2, and 
96ll-9254 of SEQ ID HO: 3, and generally is from about 
353 to about 360 nucleotides in length. This sequence 
includes the 5- ITR (bp 1-103 of the •<>7f ' 
and the packaging/enhancer domain (bp 194-358 of the 
adenovirus genome). See, Figs. lA, 3, 5. ]' 

preferably, this native adenovirus 5* region is 
employed in the shuttle vector in unmodified form. 
However, some modifications including deletions, 
"itions and additions to this sequence «hi<* do not 
adversely effect its biological function may be 

atrprable. See, e.g., WO 93/24641, P""^*^ --"^^ 
9 1993. The ability to modify these ITR sequences is 

within the ability of one of skill in the art. See 

e g., texts such as Sambrook et al. -Molecular Cloning. 

;Lboratory Manual.-, 2d edit.. Cold ^P^^^"^ "-^"'^ 

Laboratory, Cold Spring Harbot, New York 

The 3« adenovirus sequences of the shuttle 
vector include the right terminal (3') I™ '^^^^^^^l^ 
the adenoviral genome spanning about bp 35,353 - end 
the adenovirus genome, or map units "^^-^-^^O; This 
sequence is provided herein as nucleotides 607-28 of SEQ 
ID ho: 1, nucleotides 16-596 of SEQ ID HO: 2; and 
, "cleotides 3652-3073 of SEQ ID HO: 3, and generally is 
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about 580 nucleotlcles in length. This entire sequence is 
desirably employed as the 3* sequence of an pAdA shuttle 
Victor. Preferably, the native adenovirus 3* region is 
employed in the shuttle vector in unmodified form. 
5 However, some modifications tc this sequence which do not 
adversely effect its biological ftinction may be 
acceptable. 

An exemplary pAdA shuttle vector of this 
invention, described below and in Fig. 2A, contains only 

10 those adenovirus seqniences recjulred for packaging 

adenoviral genomic DNA into a preformed capsid head. The 
pAdA vector contains Ad5 sequences encoding the 5' 
terminal and 3* terminal sequences (identified in the 
description of Fig. 3) , as well as the transgene 

15 sequences described below. 

From the foregoing information, it is expected 
that one of skill in the art may employ other equivalent 
adenovirus sequences for use in the AdA vectors of this 
invention. These sequences may include other adenovirus 

20 strains, or the above mentioned cis-acting sequences with 
minor modifications. 

The transgene sequence of the vector and 
recombinant virus is a nucleic acid sequence or reverse 
25 transcript thereof, heterologous to the adenoviinis 
sequence, which encodes a polypeptide or protein of 
interest. The transgene is operatively linked to 
regulatory components in a manner which permits transgene 
transcription. 

30 The composition of the transgene sequence will 

depend upon the use to which the resulting virus will be 
put. For example, one type of transgene sequence 
includes a reporter sequence, which upon expression 
produces a detectable signal. Such reporter sequences 

35 include without limitation an E. coll beta-galactosidase 
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(LacZ) cDNA, a human placental alkaline phosphatase gene 
and a green fluorescent protein gene. These sequences, 
when associated with regulatory elements which drive 
their expression, provide signals detectable by 
5 conventional means, e.g., ultra/iolet wavelength 
absorbance, visible color change, etc. 

Another type of transgene sequence includes a 
therapeutic gene which expresses a desired gene product 
in a host cell. These therapeutic nucleic acid sequences 
10 typically encode products for adninistration and 

expression in a patient in vivo or ex vivo to replace or 
correct an inherited or non-inherited genetic defect or 
treat an epigenetic disorder or disease. Such 
therapeutic genes which are desirable for the performance 
of gene therapy include, without limitation, a normal 
cystic fibrosis transmembrane regulator (CFTR) gene (see 
Pig. 7) , a low density lipoprotein (LDL) gene [T. 
yamamoto et al, Qsll, 2a:27-28 (November, 1984)], a DMD 
CDNA sequence [partial sequences available from GenBank, 
Accession Nos. M36673, M36671, [A. P. Monaco et al, 
HatUEfi, 121:646-650 (1986)] and L06900, [Roberts et al, 
^.S^^jS^. 2:293-299 (1993)]] (GenbanJc) , and a number 
of genes which may be readily selected by one of skill in 
the art. The selection of the transgene is not 
considered to be a limitation of this invention, as such 
selection is within the knowledge of the art-skilled. 
p^rpilfttory moments 

in addition to the major elements identified 
above for the pAdA shuttle vector, i.e., the adenovirus 
sequences and the transgene, the vector also includes 
conventional regulatory elements necessary to drive 
expression of the transgene in a cell transfected with 
the pAdA vector. Thus the vector contains a selected 
promoter which is linked to the transgene and located. 



20 



25 



30 



35 



wo 96/13597 PCT/US9S/14017 



15 

with the transgene, between the adenovirus sequences of 
the vector. 

Selection of the prbnoter is a routine natter 
and is not a limitation of the pAdA vector itself. 
5 Useful promoters may be const i cut ive promoters or 
riegulated (inducible) promoters, which vill enable 
control of the amount of the transgene to be expressed. 
For example, a desirable promoter is that of the 
cytomegalovirus immediate early promoter/ enhancer [see, 

10 e.g., Boshart et al. Cell , 41:521-530 (1985)]. This 

promoter is found at nucleotides 5117-4524 of SEQ ID NO: 
1 and nucleotides 969-1563 of SEQ ID NO: 2. Another 
promoter is the CMV enhancer/ chicken B-actin promoter 
(nucleotides 9241-8684 of SEQ ZD NO: 3). Another 

15 desirable promoter includes, without limitation, the Rous 
sarcoma viziis LTR promoter/ enhancer. Still other 
promoter/ enhancer sequences may be selected by one of 
skill in the art. 

The shuttle vectors will also desirably contain 

20 nucleic acid sequences heterologous to the adenovirus 

sequences including sec^iences providing signals required 
for efficient polyadenylation of the transcript and 
introns with functional splice donor and acceptor sites 
(SD/SA) . A common poly-A sequence which is employed in 

25 the exemplary vectors of this invention is that derived 
from the papovavirus SV-40 [see, e.g., nucleotides 837- 
639 of SEQ ID NO: 1; 5245-5443 of SEQ ID NO: 2; and 3887- 
3684 Of SEQ ID NO: 3]. The pbly-A sequence generally is 
inserted in the vector following the transgene sequences 

30 and before the 3* adenovirus sequences. A common intron 
sequence is also derived from SV-40, and is referred to 
as the SV-40 T Intron sequence [see, e.g., nucleotides 
4507-4376 of SEQ ID NO: 1 and 1579-1711 of SEQ ID NO: 2). 
A pAdA shuttle vector of the present invention may also 

35 contain such an intron, desirably located between the 
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proBoter/enhancer sequence and the transgene. Selection 
of these and other common vector elements are 
conventional and many such sequences are available [see, 
e.g., Sambrook et al, and references cited therein]. 
5 Examples of such regulatory sequences for the above are 
provided in the plasmid sequences of Figs. 2, 5 and 7. 

The combination of the transgene, promoter/ 
enhancer, the other regulatory vector elements «« 
referred to as a -minigene- for ease of reference herein. 
10 The minigene is preferably flanked by the 5- and 3 cxs- 
acting adenovirus sequences described above. Such a 
minigene may have a size in the range of several hundred 
base pairs up to about 30 kb due to the absence of 
adenovirus early and late gene sequences in the vector. 
15 Thus, this AdA vector system permits a great deal of 
latitude in the selection of the various components of 
the minigene, particularly the selected transgene with 
regard to size. Provided with the teachings of this 
invention, the design of such a minigene can be made by 
20 resort to conventional techniques. 

II. The Helper virus 

Because of the limited amount of adenovirus sequence 

present in the AdA shuttle vector, a helper ^^^-o^^^^^ 
25 Lis invention must, alone or in concert with a packagxng 
cell line, provide sufficient adenovirus gene sequences 
necessary for a productive viral infection. Helper 
viruses useful in this invention thus contain selected 
adenovirus gene sequences, and optionally a second 

" ""Trmlur- Proauction of a recombinant adenovirus 
which utilizes helper adenovirus containing a full 
complement of adenoviral genes results in recombinant 
virus contaminated by excess production of the ^^^^^^ 

35 virus. Thus, extensive purification of the viral vector 
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from the contaminating helper virus is required. 
However, the present invention provides a way to 
facilitate purification and reduce contamination by 
crippling the helper virus. 
5 One preferred embodiment of a helper virus of this 

invention thus contains three components (A) 
modifications or deletions of the native adenoviral gene 
sequences which direct efficient packaging, so as to 
substantially disable or "cripple" the packaging function 

10 of the helper virus or its ability to replicate, (B) 

selected adenovirus genes and (C) an optional reporter 
minigene. These "crippled" helper viruses may also be 
formed into poly-cation conjugates as described below. 
The adenovirus sequences forming the helper virus 

15 may be obtained from the sources identified above in the 
discussion of the shuttle vector.. Use of different Ad 
serotypes as helper viruses enables production of 
recombinant viruses containing the AAd (serotype 5) 
shuttle vector sequences in a capsid formed by the other 

20 serotype adenovirus. These recombinant viruses are 

desirable in targeting different tissues, or evading an 
immxme response to the ^Ad sequences having a serotype 5 
capsid. Use of these different Ad serotype helper 
viruses may also demonstrate advantages in recombinant 

25 virus production, stability and better packaging. 
Ai. Th9 Crippling ffc>tfifi<yfftiCTP 

A desirable helper virus used in the production 
of the adenovirus vector of this invention is modified 
(or crippled) in its 5* ITR packaging/enhancer domain, 
30 Identified above. As stated above, the 

packaging/ enhancer region contains sequences necessary 
for packaging linear adenovirus genomes ("PAC" 
sequences) • Hore specifically, this sequence contains at 
least seven distinct yet functionally redundant domains 
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that are required for efficient encapsidation of 

replicated viral DHA. 

Within a stretch of nucleotide sequence from bp 
194-358 of the Ad5 genome, five of these so-called A- 
repeats or PAC sequences are Iccalized (see. Fig. IB) . 
PAC I is located at bp 241-248 of the adenovirus genome 
(on the strand complementary to nucleotides 5259-5246 of 
SEQ ID no: I) . PAC II is located at bp 262-269 of the 
adenovirus genome (on the strand complementary to 
nucleotides 5238-5225 of SEQ ID NO: 1) . PAC III is 
located at bp 304-311 of the adenovirus genome (on the 
strand complementary to nucleotides 5196-5183 of SEQ ID 
NO: 1). PAC IV is located at bp 314-321 of the 
adenovirus (on the strand complementary to nucleotides 
5186-5172 of SEQ ID NO: 1) . PAC V is located at bp 339- 
346 of the adenovirus (on the strand coii«>lementary to 
nucleotides 5171-5147 of SEQ ID NO: 1) . 

Corresponding sequences can be obtained from 
SEQ ID no: 2 and 3. PACI is located at nucleotides 837- 
851 of SEQ ID no: 2; and on the strand complementary to 
nucleotides 9374-9360 of SEQ ID NO: 3. PAC II is located 
at nucleotides 859-863 of SEQ ID NO: 2; and on the strand 
complementary to nucleotides 9353-9340 of SEQ ID NO: 3. 
PAC III is located at nucleotides 901-916 of SEQ ID NO: 
2; and on the strand complementary to nucleotides 9311- 
9298 of SEQ ID NO: 3. PAC IV is located at nucleotides 
911-924 of SEQ ID NO: 2; and on the strand complementary 
to nucleotides 9301-9288 of SEQ ID NO: 3. PAC V is 
located at nucleotides 936-949 of SEQ ID NO: 2; and on 
the strand complementary to nucleotides 9276-9263 of SEQ 
ID NO: 3. 
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Table 1 below lists these five native Ad5 
sequences and a consensus PAC sequence based on the 
similarities between an eight nucleic acid stretch within 
the five sequences. The consensus sequence contains two 
positions at which the nucleic acid may be A or T (A/T) • 
The conventional single letter designations are used for 
the nucleic acids, as is known to the art. 

Table 1 



Adenovirus Genome 
Base Pair Nos. & 
ft-pepe^i; ^Wleptid? pequenge 

15 241 248 

I TAG TAAATTTG G6C [SEQ ID NO: 4] 

262 269 

II AGT AAGATTTG GCC [SEQ ID NO: 5] 



304 311 

III AGT GAAATCTG AAT [SEQ ID NO: 6] 



314 321 

25 IV GAA TAATTTTG T6T [SEQ ID NO: 7] 

339 346 

V CGT AATAtTTG TCT [SEQ ID NO: 8] 

30 Consensus 5* (A/T)AN(A/T)TTTG 3* [SEQ ID NO: 9] 

According to this invention, mutations or 
deletions may be made to one or more of these PAC 
secjuences to generate desirable crippled helper viruses. 

35 A deletion analysis of the packaging domain revealed a 

positive correlation between encapsidation efficiency and 
the number of packaging A-^repeats that were present at 
the 5* end of the genome. Modifications of this domain 
may include 5' adenovirus sequences which contain less 

40 than all five of the PAC sequences of Table 1. For 
example, only two PAC secpiences may be present in the 
crippled virus, e.g., PAC I and PAC II, PAC III and PAC 
IV, and so on. Deletions of selected PAC sequences may 
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involve deletion of contiguous or non-contiguous 
sequences. For exeunple, PAC II and PAC IV »ay be 
deleted, leaving PAC I, HI and IV in the 5* sequence. 
Still an alternative modification may be the replacement 
of one or more of the native P*.C sequences with one or 
more repeats of the consensus sequence of Table 1. 
Alternatively, this adenovirus region may be modified by 
deliberately inserted mutations which disrupt one or more 
of the native PAC sequences. One of skill in the art may 
further manipulate the PAC sequences to similarly achieve 
the effect of reducing the helper virus packaging 
efficiency to a desired level. 

Exemplary helper viruses which involve the 
manipulation of the PAC sequences described above are 
disclosed in Example 7 below. Briefly, as described in 
that example, one helper virus contains in place of the 
native 5« ITR region (adenovirus genome bp 1-360). a 5 
adenovirus sequence spanning adenovirus genome bp 1-269, 
which contains only the 5' ITR and PAC I and PAC II 
sequences, and deletes the adenovirus region bp 270-360. 

Another PAC sequence modified helper virus 
contains only the 5' Ad5 sequence of the ITR and PAC I 
through PAC IV (Ad bp 1-321) , deleting PAC V and other 
sequences in the Ad region bp322-360. 

These modified helper viruses are characterized 
by reduced efficiency of helper virus encapsidation. 
These helper viruses with the specific modifications of 
the sequences related to packaging efficiency, provide a 
packaging efficiency high enough for generating 
production lots of the helper virus, yet low enough that 
they permit the achievement of higher yields of AdA 
transducing viral particles according to this invention. 
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The S3lectBd AdBnovlrus GenBs 
Helper viruses useful in this invention, 
whether or not they contain the "crippling" modifications 
described above, contain selected adenovirus gene 
5 sequences depending upon the call line which is 

transfected by the helper virus and shuttle vector. A 
preferred helper virus contains a variety of adenovirus 
genes in addition to the modified sequences described 
above. 

10 As one example, if the cell line employed to 

produce the recombinant virus is not a packaging cell 
line, the helper virus may be a wild type Ad virus. 
Thus, the helper virus supplies the necessary adenovirus 
early genes El, E2, E4 and all remaining late, 

15 intermediate, structural and non-structural genes of the 
adenovirus genome. This helper virus may be a crippled 
helper virus by incorporating modifications in its native 
5* packaging/ enhancer domain. 

A desirable helper virus is replication 

20 defective and lacks all or a sufficient portion of the 

adenoviral early immediate early gene Ela (which spans mu 
1.3 to 4.5) and delayed early gene £lb (which spans mu 
4.6 to 11.2) so as to eliminate their normal biological 
functions. Such replication deficient viruses may also 

25 have crippling modifications in the packaging/enhancer 
domain. Because of the difficulty surrounding the 
absolute removal of adenovirus from AdA preparations that 
have been enriched by CsCl buoyant density 
centrifugation, the use of a replication defective 

30 adenovirus helper prevents the introduction of infectious 
adenovirus for in vivo animal studies. This helper virus 
is employed with a packaging cell line which supplies the 
deficient £1 proteins, such as the 293 cell line. 
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Additianally, all or a portion of the 
adenovirus delayed early gene E3 (which spans mi 76.6 to 
86.2) »ay be eliminated fro. the adenovirus '•^^^ 
Which forms a part of the helper viruses --^^^^^^^f 
invention, without adversely affecting the unction of 
the helper virus because this gene product is not 
necessary for the formation of a functioning vxrus 

in the presence of other packaging cell lines 
Which are capable of supplying adenoviral P-^*^^^'^ 
addition to the El, the helper virus may accordingly be 
debited of the genes encoding these adenoviral proteins. 

such additionally deleted helper vi-ses also desirably 
contain crippling modifications as described above. 

It is also desirable for the helper virus to 
contain a reporter minigene. in which the reporter gene 
is desirably different from the reporter transgene 
contained in the shuttle vector. A number of such 
reporter genes are )cnown, as -^-"^ 
presence of a reporter gene on the ^ 
different from the reporter gene on the pAdA. allows both 
the recombinant AdA virus and the helper virus to be 
Independently monitored. For example, the expression of 
recombinant alkaline phosphatase enables 
cjuantities of contaminating adenovirus to be monitored 
dependent of recombinant LacZ expressed by an pAdA 
shuttle vector or an AdA virus. 

^ ff^Tp^r Vnlvrntinn rpniufffltgg 

Still another method for reducing the 
contamination of helper virus involves the formation of 
poly-cation helper virus conjugates, which may be 
Lsociated with a plasmid containing other adenoviral 
genes, which are not present in the helper -rus. The 
helpe; viruses described above may be further modified by 
i resort to adenovirus-polylysine conjugate technology. 
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See, e.g., Wu et al, J, Biol. Chein> . 2M: 16985-16987 
(1989); and K. J. Fisher and J. H* Wilson, Bioehem, J, . 
222: 49 (April l, 1994), incorporated herein by 
reference. 

5 Using this technology, a helper virus 

containing preferably the late adenoviral genes is 
modified by the addition of a poly-cation secpience 
distributed around the capsid of the helper virus. 
Preferably, the poly-cation is poly-lyslne, which 

10 attaches around the negatively-charged vector to form an 
external positive charge. A plasmid is then designed to 
express those adenoviral genes not present in the helper 
virus, e.g., the El, E2 and/ or E4 genes. The plasmid 
associates to the helper virus-conjugate through the 

15 charges on the poly-lysine sequence. This modification 
is also desirably made to a crippled helper virus of this 
invention. This conjugate (also termed a trans- infection 
particle) permits additional adenovirus genes to be 
removed from the helper virus and be present on a plasmid 

20 which does not become Incorporated Into the virus during 
production of the recombinant viral vector* Thus, the 
Impact of contamination is considerably lessened. 

IZX. Assembly of Shuttle Veotor, Helper Virus and 

25 Production of Reooad^inaiit Virus 

The material from which the sequences used in the 
pAdA shuttle vector and the helper viruses are derived, 
as well as the various vector - components and sequences 
employed in the construction of the shuttle vectors, 

30 helper viruses, and AdA viruses of this Invention, are 
obtained from commercial or academic sources based on 
previously published and described materials. These 
materials may also be obtained from an individual patient 
or generated and selected using standard recombinant 

35 molecular cloning techniques known and practiced by those 
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sHilled in the a.t. Any Bodification J^^^*^^; 
acid sequences fonaing the vectors and viruses, including 
sequence deletions, insertions, and other mutations are 
also generated using standard techniques. 

Assembly of the selected DJA sequences of the 
adenovirus, and the reporter genes or therapeutic genes 
and other vector elements into the pAdA shuttle vector 
using conventional techniques is described in Example 1 
below, such techniques include conventional cloning 
techniques of cDNA such as those described in texts 
rsambrook et al, cited above], use of overlapping 
oligonucleotide sequences of the ad«»ovirus genomes, 
pol^erase chain reaction, and any suitable method which 
provides the desired nucleotide sequence. Standard 
Lansfection and co-transfection "l^^^^"*' 
e.g., CaPO, transfection techniques using the HEK 293 
cell line. Other conventional methods employed in this 
invention include homologous recombination of the viral 
genomes, plaguing of viruses in agar overlay, ^^^^^^^ 
^lasuriig signal generation, and the li.e. ^sse-b^ of 
any desired Ad. vector or helper virus of this i--^^^ 
is within the skill of the art, based on the teachings of 
this invention. 

AS described in detail in Example 1 below and 
with resort to Fig. 2A and the DMA sequence of the 
Tllid reported in Fig. 3, a unique pAdA shuttle vector 
of this invention, pAdA . CMVLacZ , is generated. 
pAdA.CMVLacZ contains Ad5 sequences encoding «»e 5 
terminal followed by a CMV promoter /enhancer, a splice 
donor/splice acceptor sequence, a bacterial beta- 

iT.ez^ a SV-40 poly A sequence (pA) , 
galactosidase gene (LacZ) , a sv f j 

a 3. ITR from Ad5 and remaining plasmid sequence from 
plasmid PSP72 (Promega) backbone. 
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To generate the AdA genome which Is 
incorporated in the vector, the plasmid pAdA.CMVLacZ must 
be must be digested with EcoRI to release the AdA • CHVXiacZ 
genome, freeing the adenovirus ITRs and making them 
5 available targets for replication. Thus production of 
the vector is "restriction-dependent", i.e., requires 
restriction endonuclease rescue of the replication 
template. See, Fig. 2B. 

A second type of pAdA plasmid was designed 

10 which places the 3* Ad terminal sequence in a head-to- 
tail arrangement relative to the 5* terminal sequence. 
As described in Example 1 and Figs. 4A, and with resort 
to the DNA sequence of the plasmid reported in Fig. 5, a 
second unique AdA vector sequence of this invention, 

15 AdAc.CMVLacZ, iis generated from the shuttle plasmid 

pAdAc.CHVLacZ, which contains an Ad5 5' ZTR sequence and 
3* ITR sequence positioned head-to-tail, followed by a 
CMV enhancer/ promoter, SD/SA sequence, LacZ gene and pA 
sequence in a plasmid pSP72 (Promega) backbone. As 

20 described in Example IB, this restrict ion- independent** 
plasmid permits the AdA genome to be replicated and 
rescued from the plasmid backbone without including an 
endonuclease treatment (see. Fig. 4B) . 
£4. Helper Virus 

25 As described in detail in Exaaple 2, an 

exemplary conventional Ei deleted adenovirus helper virus 
is virus Ad.CBhpAP, which contains a 5' adenovirus 
sequence from mu 0-1, a reporter minigene containing 
human placenta alkaline phosphatase (hpAP) under the 

30 transcriptional control of the chicken fl-actin promoter, 
followed by a poly-A sequence from SV40, followed by 
adenovirus sequences from 9.2 to 78.4 and 86 to 100. 
This helper contained deletions from mu 1.0 to 9.2 and 
78.4 to 86, which eliminate substantially the El region 

35 and the E3 region of the virus. This virus may be 
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desirably crippled according to this invention by 
modifications to its packaging enhancer dorB^in. 

Exemplary crippled helper viruses of this 
invention are described using the techniques described 
5 Example 7 and contain the Bodiaed 5- -egu^ces, 
i.e., adenovirus genome bp 1-269; ».u. 0'0,^^ or 
adenovirus genome bp 1-321; «.u. 0-0.89. B"-"^' ' 
ttZnces L modified by PCR and cloned by conventional 
tl^i^es into a conventional adenovirus P^-^^' 
.0 r^^inigene is incorporated into "^e plasm^. 

is Ln altered by homologous recombination wi^ an E3 
deleted adenovirus dl7001 to result in the 
vectors so that the reporter minigene is followed nj^^ 
3. end with the adenovirus sequences mu 9.6 to 78.3 ana 

" " ""oeneration of a poly-L-lysine -.-ate -^^^^^ 
Virus was demonstrated essentially as ^-^^^^^^".^^^^^^^^ 
in Example 5 below and Fig. 10 by coupling P-^^-^^^^^ 
To the L.CBhpAP virion capsid. -^-rnatively t^ same 

20 procedure may be employed with the PAC sequence modified 
helper viruses of this invention. 

AS stated above, a pAdA shuttle vector in the 

presence of helper virus and/or a ^^^'^ 
« permits the adenovirus-transgene sequences in the shuttle 
V^^^Tle replicated and packaged into virion capsxds, 
ZiZ in the recombinant AdA virus. The current 

::rd7ort::cing such AdA virus is 

descrlb.a in detail m BcaBple 3. BrUfly. helper 

rjrr.. « ^r.,^ =.ii.. ^ - --^^ 

V HFK 293 which are then subsequently 
line human HEK wuiwi* » 

trans£.ct.d «ith «. pA^ .huttl. vector conf^g a 
.elected transgene W conventional «tl,od.. About 30 
::re nour. po.t-tran.fe=ti.n, the ells are harvested, 
,S and an extract prepare.. Th. «U viral geno.. x. 
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packaged into virions that sediment at a lower density 
than the helper virus in cesium gradients. Thus, the 
recombinant AdA virus containing a selected transgene is 
separated from the bulk of the helper virus by 
5 purification via buoyant density ultracentrifugation in a 
CsCl gradient. 

The yield of AdA transducing virus is largely 
dependent on the number of cells that are trans fected 
with the pAdA shuttle plasmid, making it desirable to use 

10 a transf action protocol with high efficiency. One such 
method involves use of a poly-L-lysinylated helper 
adenovirus as described above. A pAdA shuttle plasmid 
containing the desired transgene under the control of a 
suitable promoter, as described alsove, is then complexed 

15 directly to the positively charged helper virus .caps id, 
resulting in the formation of a single transfection 
particle containing the pAdA shuttle vector and the 
helper functions of the helper virus. 

The underlying principle is that the helper 

20 adenovirus coated with plasmid pAdA DNA will co-transport 
the attached nucleic acid across the cell membrane and 
into the cytoplasm according to its normal mechanism of 
cell entry. Therefore, the poly-L-lysine modified helper 
adenovirus assumes multiple roles in the context of an 

25 AdA-based complex. First, it is the structural 

foundation upon which plasmid DNA can bind increasing the 
effective concentration. Second, receptor mediated 
endocytosis of the virus provides the vehicle for cell 
uptake of the plasmid DNA. Third, the endosomalytic 

30 activity associated with adenoviral infection facilitates 
the release of internalized plasmid into the cytoplasm. 
And the adenovirus contributes trans helper functions on 
which the recombinant AdA virus is dependent for 
replication and packaging of transducing viral particles. 

35 The Ad-based transfection procedure using an pAdA shuttle 
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vector and a polycation-helper conjugate 
Example 6. Additionally, as described previously, the 
helper virus-plasmid conjugate may be another for» of 
helper virus delivery of the omitted adenovirus genes ^t 
5 present in the pAdA vector, such a structure enables the 
rest of the required adenovirus genes to be divided 
between the plasmid and the helper virus, thus reducing 
the self-replication efficiency of the helper virus. 

A presently preferred method of producing the 
10 recombinant AdA virus of this invention involves 

performing the above-described transfection with the 
crippled helper virus or crippled helper virus conjugate, 
as described above. A "crippled- helper virus of this 
invention is unable to package itself efficiently, and 
15 therefor permits ready separation of ^^^^^^^^^^^^^ 
from the newly packaged AdA vector of this invention by 
use of buoyant density ultracentrifugation in a CsCi 
gradient, as described in the examples below. 

20 IV. Function of the Recombinant AdA Virus 

once the AdA virus of this invention is produced by 

cooperation of the shuttle vector and helper ^iru.. the 
AdA virus can be targeted to, and taken up by, a selected 
target cell. The selection of the target cell also 
25 depends upon the use of the recombinant ^'•''^.^^ 
Whether or not the transgene is to be replicated m v.tro 
or BX vivo for production in a desired cell type for 
redelivery into a patient, or in vivo for <^-l-^^^° ' 
particul« cell type or tissue. Target cells may be any 
30 mammalian cell (preferably a human cell,. J^-J^^''^ 
in in vivo use, the recombinant virus can target to any 
Tell t;^e normluy infected by adenovirus, depending up- 
the ro^e of administration, i.e., it can target, without 
limitation, neurons, hepatocytes, epithelial cells and 
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the like. The helper adenovirus sequences supply the 
sequences necessary to permit uptake of the virus by the 
AdA. 

Once the recombinant virus is taken up by a cell, 
5 the adenovirus flanked transg^ne is rescued from the 
parental adenovirus backbone by the machinery of the 
infected cell, as with other recombinant adenoviruses. 
Once uncoupled (rescued) from the genome of the AdA 
virus, the recombinant minigenie seeks an integration site 
10 in the host chromatin and becomes integrated therein, 

either transiently or stably, providing expression of the 
accompanying transgene in the host cell* 

V. Use of the AdA Viruses in Gene Therapy 

15 The novel recombinant viruses and viral conjugates 

of this invention provide efficient gene transfer 
vehicles for somatic gene therapy. These viruses are 
prepared to contain a therapeutic gene in place of the 
LacZ reporter transgene . illustrated in the exemplary 

20 viruses and vectors. By use of the AdA viruses 

containing therapeutic transgenes, these transgenes can 
be delivered to a patient in vivo or ex vivo to provide 
for integration of the desired gene into a target cell. 
Thus, these viruses can be employed to correct genetic 

25 deficiencies or defects. An example of the generation of 
an AdA gene transfer vehicle for the treatment of cystic 
fibrosis is described in Example 4 belov. One of skill 
in the art can generate any number of other gene transfer 
vehicles by including a selected transgene for the 

30 treatment of other disorders. 

The recombinant viruses of the present invention may 
be administered to a patient, preferably suspended in a 
biologically compatible solution or pharmaceutically 
acceptable delivery vehicle. A suitable vehicle includes 

35 sterile saline. Other aqueous and non^aqueous isotonic 
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sterile injection solutions and aqueous and non-aqueous 
sterile suspensions known to be pharmaceutically 
acceptable carriers and veil Icnown to those of sKill in 
the art Bay be employed for this purpose. 

The recombinant viruses ot this invention may be 
administered in sufficient amounts to transfect the 
desired cells and provide sufficient levels of 
•"egration and expression of the selected transgene to 
UvL a therapeutic benefit without -^-^^^^^^ 
L feet, or with medically acceptable P^^y^^^^?'^^^ 
effects Which can be determined by those skilled in the 
medical arts. Conventional and P*^*"^""*^^''^^ . 
acceptable parenteral routes of administration xnclude 
direct delivery to the target organ, tissue or site, 
intrlsai, intravenous, intramuscular, subcutaneous, 
intradermal and oral administration. Routes of 
administration may be combined, if 

Dosages of the recombinant virus will depend 
primarily'on factors such as the edition ^ing treated, 
the selected gene, the age, weight and health of the 
patient, and may thus vary among P^^^*"^- ^ 
therapeutically effective human dosage of ^ J^^^^^^ 
the present invention is believed to be in the range 
from about 20 to about 50 ml of saline solution 

the therapeutic benefit eg.lnet »y .id. .ff.ct.. Th. 
level, of expression ot th. «.l*ct^ ^ 
.onitored to detenan. th. .election, .d,ust«nt or 
frequency ot do.ege »iJmlnirtr.tlon. 
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The following examples Illustrate the construction 
of the pAdA shuttle vectors, helper viruses and 
recombinant AdA viruses of the present invention and the 
use thereof in gene therapy. These examples are 
5 illustrative only, and do not limit the scope of the 
present invention. 

Example 1 - Production of pAdA>CMVLacZ and pAdAc.CMVLftCZ 

10 A. pAdA.CMVLacZ 

A hviman adenovirus Ad5 sequence was modified to 
contain a deletion in the Ela region [map units 1 to 
9.2], which immediately follows the Ad 5» region (bp 1- 
360) (illustrated in Figs. lA) . Thus, the plasmid 

15 contains the 5* ITR sequence (bp 1-103) , the native 

packaging/ enhancer sequences and the TATA box for the Ela 
region (bp 104*360). A minigene containing the CMV 
ismediate early enhancer /promoter, an SD/SA seqpience, a 
cytoplasmic lacZ gene, and SV40 poly A (pA) , was 

20 introduced at the site of the Ela deletion. This 

construct was further modified so that the minigene is 
followed by the 3« ITR sequences (bp 35,353-end). The 
DNA sequences for these components are provided in Fig. 3 
and SEQ ID NO: 1 (see, also the brief description of this 

25 figure) . 

This construct was then cloned by conventional 
techniques into a pSP72 vector (Promega) backbone to make 
the circular shuttle vector pAdACMVLacZ. See the 
schematic of Pig. 2A. This construct was engineered with 

30 EcoRI sites flanking the 5* and 3« Ad5 ITR sequences. 
pAdA.CMVLacZ was then subjected to enzymatic digestion 
with EcoRI, releasing a linear fragment of the vector 
spanning the terminal end of the Ad 5 'ITR sequence 
through the terminal end of the 3»ITR sequence from the 

35 plasmid backbone. See Fig. 2B. 
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The shuttle vector pAdAc.CMVLacZ (Figs. 4A and 
5) was constructed using a pSP72 (Promega) backbone so 
that the Ad5 5' ITR and 3' ITR were positioned head-to- 
5 tail. The organization of the Ad5 ITRs was based on 
reports that suggest circular Ad genomes that ^^ve the 
t^inal ends fused together head-to-tail are infectious 
to levels comparable to linear Ad genomes. A minigene 
encoding the OCV enhancer, an SD/SA sequence, LacZ 
XO gene, and the poly A sequence was inserted 

following the 5- ITR. The DHA sequence of the resulting 
plasmid and the sequences for the individual components 
Le reported in Fig. 5 and SEQ ID HO: 2 (see also, brief 
description of Fig. 5). This plasmid does not ^equxre 
X5 enzymatic digestion prior to its use to P-duce the vxral 
particle (see Example 3). This vector was designed to 
enable restriction-independent production of LacZ Ad* 
vectors. 

The Ad.CBhpM. help«r vlr«. tK- 

«„1 r..n.t. . 12(5) =449-45. (19M)) i- • "PUcatlon 

ZlJnt n.n!;ir«. co„t.inl„, «. f 

.l„ia.». If co»tructio„ l»volv«l conventional olonxn, 

„ no^logou. r.co^in.tion fcnni^- ,^7^';™ 

™» BUb.tr.te w« extracted fro. cscl purified dl7001 
virions, .n Ads (eerotyp. subgroup c) variant that 
carries a 3 W> deletion between bu 7..4 tbrough 86 In the 
n:::s=entlal E3 region .provided by Or. "i"^";; ^ 

30 Washington university. St. I-uis, Hl.souri . Vl«l^ 
was prepared for co-transfectlon by digestion vitb C1.I 
(adenovirus genomic bp position 917. 

left arm of the g«.o~ .noo.*as.lng .a«.oviru. map units 
0-2.5. see lower diagram of Pig. »• 
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A parental cloning vector, pAd.Bglll was designed. 
It contains two segments of wild-type AdS genone (I.e., 
map units 0^1 and 9-16.1) separated by a unique Bglll 
cloning site for insertion of heterologous sequences. 
5 The missing AdS se<3uences betveen the two domains 

(adenovirus genome bp 361-3327) results in the deletion 
of Ela and the majority of Elb following recombination 
with viral DNA. 

A recombinant hpAP minlgene was designed and 
10 inserted into the Bglll site of pAd.Bglll to generate the 
complementing plasmid, pAdCBhpAP* The linear arrangement 
of this minigene includes: 

(a) the chicken cytoplasmic £-actin promoter 
[nucleotides 4-1 to -1-275 as described in T. A. Kost et al, 

15 NWlt AgldB Rggt> 11(23) :8287 (1983); nucleotides 9241- 
8684 of Fig. 7]; 

(b) an SV40 intron (e.g., nucleotides 1579-1711 of 
SEQ ID NO: 2) , 

(c) the sequence for human placental alkaline 
20 phosphatase (available from Genbank) and 

(d) an SV40 polyadenylation signal (a 237 Bam HI- 
Bell restriction fragment containing the cleavage/poly-A 
signals from both the early and late transcription units; 
e.g., nucleotides 837-639 of SEQ ID NO: 1). 

25 The resulting complementing plasmid, pAdCBhpAP 

contained a single copy of recombinant hpAP minigene 
flanked by adenovirus coordinates 0-1 on one side and 
9.2-16.1 on the other. 

Plasmid DNA was linearized using a unique Nhel site 

30 immediately 5* to adenovirus map unit zero (0) and the 
above-identified adenovirus substrate and the 
complementing plasmid DNAs were transfected to 293 cells 
[ATCC CRL1573] using a standard calcium phosphate 
transfection procedure [see, e.g., Sambrook et al, cited 

35 above]. The end result of homologous recombination 
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involving sequences that map to adenovirus map units 9- 
16.1 is hybrid Ad.CBhpAP helper virus ^ich contain, 
adenovirus nap units 0-1 and, in place of the Eia and Elb 
coding regions from the dl7001 adenovirus substrate, is 
the hpAP minigene from the plasmid, f 
sequences 9 to 100, with a deletion in the E3 (78.4-86 

mu) regions. 

i^e recombinant AdA virus of this invention are 
generated by co-transf ection of a shuttle vector with the 
helper virus in a selected packaging or non-packagxng 

cell line. ^ 

AS described in detail below, the linear fragment 
provided in Example lA, or the circular AdA genome 
carrying the LacZ of Example IB, is packaged intone 
Ad.CBhpAP helper virus (Example 2) using conventional 
techniques, which provides an empty capsid ^ead^s 
illustrated in Fig. 2C. Those virus particles which have 
successfully taken up the pAd shuttle genome «to the 
capsid head can be distinguished from those containing 
the hpAP gene by virtue of the differential expression of 

LacZ and hpAPe -i-i«/-icn 
in more detail, 293 cells (4 x 10^ pfu 293 cells/150 
imn dish) were seeded and infected with helper virus 
Ad.CBhpAP (produced as described in Example 2) at an MOl 
of 5 in 20 ml DMEM/2% fetal bovine serum (FBS) . This 
helper specific marker is critical for monitoring the 
level of helper virus contamination in AdA P^-P'-*^^^"^ 
Lfore and after purification. The helper vi-- provides 
in trans the necessary helper functions for synthesis and 
packaging of the AdACMVLacZ genome. 

TWO hours post infection, using either the 
restriction-dependent shuttle vector or the restriction- 
independent shuttle vector, plasmid pAdA.CMVLacZ 
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(digested with ScoRI) or pAdAc.CMVLacZ DNA, each carrying 
a LacZ minigene, was added to the cells by a calcium 
phosphate precipitate (2.5 ml calcium phosphate 
transf action cocktail containing 50 fig plasmid DNA) • 
5 Thirty to forty hours pob^-transf action, cells were 

harvested, suspended in 10 aM Tris-Cl (pH 8.0) (0.5 
ml/150 mm plate) and frozen at *-80^C. Frozen cell 
suspensions were subjected to three rounds of freeze 
(ethanol-dry ice) -thaw (27^C) cycles to release virion 
10 caps ids. Cell debris was removed by centrifugation 

(5,000xg for 10 minutes) and the clarified supernatant 
applied to a CsCl gradients to separate recombinant virus 
frcna helper virus as follows. 

Supernatants (10 ml) applied to the discontinuous 
15 CsCl gradient (composed of equal volumes of CsCl at 1.2 
g/ml, 1.36 g/ml, and 1.45 g/ml 10 iM Tris-Cl (pH 8.0)) 
were centrifuged for 8 hours at 72,128Xg, resulting in 
separation of infectious helper virus from incompletely 
formed virions. Fractions were collected from the 
20 interfacing zone between the helper and top components 
and analyzed by Southern blot hybridization or for the 
presence of LacZ transducing particles. For functional 
analysis, aliguots (2.0 ml from each sample) from the 
same fractions were added to monolayers of 293 cells (in 
25 35 mm wells) and expression of recombinant B«- 
galactosidase determined 24 hours later. More 
specifically, monolayers were harvecfted, suspended in 0.3 
ml 10 mM Tris-Cl (pH 8.0) buff er and an extract prepared 
by three rounds of freeze-rthaw cycles. Cell debris was 
30 removed by centrifugation and the supernatant tested for 
B-galactosidase (LacZ) activity according to the 
procedure described in J. Price et al, Proc. Natl. Acad. 
Sci. . USA , M: 156-160 (1987). The specific activity 
(milliunlts B-galactosidase/mg protein or reporter 
35 . • • 
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enzymes was measured from indicator cells. Por the 
recombinant virus, specific activity was 116. 

Fractions with B-galactosidase activity from the 
discontinuous gradient were sedimented through an 
ccuilibrium cesium gradient to further enrich the 
preparation for Ad^ virus. A linear gradient was 
generated in the area of the recombinant virus spanning 
densities 1.29 to 1.34gm/ml. A sharp peak of the 
recombinant virus, detected as the appearance of the 6- 
gal activity in infected 293 cells, eluted between 1.31 
and 1.33 gm/dl. This peaX of recombinant virus was 
located between two major A^eo absorbing peaks and in 
an area of the gradient with the helper virus was 
precipitously dropping off. The equilibrium 
sedimentation gradient accomplished another 102 to 103 
fold purification of recombinant virus from helper vxrus^ 

I.- hha rMviAcZ virus recovered from 
The yield of recombinant AdA.CMVLacZ viru 

a 50 plate prep after 2 sedimentations ranged from 107 to 
108 transducing particles. 

Analysis of lysates of cells transfected with the 
recombinant vector and infected with helper "vealed 
virions capable of transducing the recombinant ^r.i<,-ne 
contained within the vector. Subjecting aU^o^s of the 
fractions to Southern analysis using probes specific to 
^:^t::mbinant virus or helper virus revealed packaging 
of multiple molecular forms of vector derived Bequence 
The predominant form of the deleted viral genome was the 
size (-5.5 kb) of the corresponding double stranded DHA 
.onomer (Ad..CKVLacZ) with less abundant ^^^^^^^^^ 
higher molecular weight species ("10 kb and 15 kb) also 
present. Full-length helper virus is 35kb. I»P-^-^^^ 
L peak of vector transduction activity corresponds with 
the highest molecular weight form of the deleted virus. 
These results confirm the hypothesis that ITRs and 
contiguous packaging sequence are the only elements 



W0 9S/13597 



PCT/US95/14017 



37 

necessary for incorporation into virions • An apparently 
ordered or preferred rearrangement of the recombinant Ad 
monomer genome leads to a more biologically active 
molecule. The fact that larger molecular species of the 
5 deleted genome are 2x and 3x i old larger than the monomer 
deleted virus genome suggests that the rearrangements may 
involve secpaential duplication of the original genome. 

These same procedures may be adapted for production 
of a recombinant AdA virus using a crippled helper virus 
10 or helper virus conjugate as described previously. 

Example 4 - Recombinant AdA Virus Containing a 

Therapeutic Minioene 

To test the versatility of the recombinant AdA virus 
15 system, the reporter LacZ minigene obtained from 

pAdACMVLacZ vas cassette replaced with a therapeutic 

minigene encoding CFTR. 

The minigene contained human CFTR cDNA [Riordan et 

al. Science . ilS:1066-1073 (1989); nucleotides 8622-4065 
20 of SEQ ID NO: 3] under the transcriptional control of a 

chimeric CMV enhancer/chicken B-actin promoter element 

(nucleotides «fl to +275 as described in T. A. Xost et al, 

Nucl, Acids Res.- 11(23) :8287 (1983); nucleotides 9241- 

8684 Of SEQ ID NO: 3, Fig. 7); and followed by an SV-40 
25 pply-A sequence (nucleotides 3887-3684 of SEQ ID NO: 3, 

Pig. 7). 

The CFTR minigene was inserted into the El deletion 
site of an Ad5 virus (called pAd.ElA) which contains a 
deletion in Ela from mu 1-9.2 and a deletion in E3 from 
30 mu 78.4-86. 

The resulting shuttle vector called pAdA.CBCFTR (see 
Figs. 6 and the DNA sequence of Fig. 7 [SEQ ID NO: 3]) 
used the same Ad ITRs of pAdACKVLacZ, but the Ad5 
sequenceis terminated with Nhel sites instead of EcoRI. 
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Therefore releass of the minigene f roB the plasmid was 
accomplished by digestion with Nhel. 

The vector production system described in Example 3 
was employed, using the helper virus Ad.CBhpAP (Example 
2) . Monolayers of 293 cells grown to 80-90% conf luency 
in 150 mm culture dishes were infected with the helper 
virus at an MOI of 5. Infections were done an DMEM 
supplemented with 2% FBS at 20 ml media/150 mm plate. 
TWO hours post-infection, 50 ,g plasmid DNA in 2.5 ml 
transfection cocktail was added to each plate and evenly 
distributed. 

Delivery of the pAdA.CBCFTR plasmid to 293 cells was 
mediated by formation of a calcium phosphate precipitate 
and AdA.CBCPPR virus resolved from Ad.CBhpAP helper virus 
by CsCl buoyant density ultracentrifugation as follows: 

cells were left in this condition for 10-14 h, 
af terwhich the inf ection/transf ection media was replaced 
with 20 ml fresh DMlM/2% FBS. Approximately 30 h post- 
transfection, cells were harvested, suspended in 10 m 
Tris-Cl (pH 8.0) buffer (0.5 ml/150 mm plate), and stored 
at -80«C. 

Frozen cell suspensions were lysed by three 
sequential rounds of freeze (ethanol-dry 
(37-C). cell debris was removed by centrifugation (5,000 
X g for 10 min) and 10 ml clarified extract layered on^ 
a CsCl step gradient composed of three 9.0 ml txers with 
densities 1.45 g/ml, 1.36 g/ml, and 1.20 g/ml ^sCl in 10 
^ Tris-Cl (PH 8.0) buffer. Centrifugatxon was Performed 
at 20,000 rpm in a Beckman SW-28 rotor for 8 h at 4 C. 
Fractions (1.0 ml) were collected from the bottom of the 
centrifuge tube and analyzed for rAAd transducing 
vectors. Peak fractions were combined and banded to 
ecriilibrium. Fractions containing transducing virions 
wL dialyzed against 20 HEPE8 (pH 7.8)/150 mM NaCl 
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(HBS) and stored frozen at -80«C in the presence of 10% 
glycerol or as a liquid stock at *20*C (HBS+40% 
glycerol) • 

Fractions collected after ultracentrifugation were 
5 analyzed for transgene eafpresbion and vector DNA. For 
lacZ ArAd vectors, 2 Ml aliquots were added to 293 cell 
monolayers seeded in 35 vm culture wells. Twenty-foxu: 
hours later cells were harvested, suspended in 0.3 ml 10 
mM Tris-Cl (pH 8.0) buffer, and lysed by three rounds of 

10 freeze-thav. Cell debris was removed by centrifugation 
(15,000 X g for 10 min) and assayed for total protein 
[Bradford, (1976)] and p*galactosidase activity [Sambrook 
et al, (I9a9)] using 0NP6 (o-Nitrophenyl p-D-* 
galactopyranoside) as substrate. 

15 Expression of CFTR protein from the AdA.CBCFTR 

vector was determined by immunofluorescence localization. 
Aliquots of AdA.CBCPTR, enriched by two-rounds of 
ultracentrifugation and exchanged to HBS storage buffer, 
were added to primary cultures of airway epithelial cells 

20 obtained from the lungs of CF transplant recipients. 
Twenty-four hours after the addition of vector, cells 
were harvested and affixed to glass slides using 
centrifugal force (Cytospin 3, Shandon Scientific 
Limited) . Cells were fixed with freshly prepared 3% 

25 paraformaldehyde in PBS (1.4 mM KH2PO4, 4.3 lAM Na2HP04, 
2.7 mM KCl, and 137 mH NaCl) for 15 min at room 
temperature (RT) , washed twice in PBS, and permeabilized 
with 0.05% NP-40 for 10 min at RT. The 
immunofluorescence procedure began with a blocking step 

30 in 10% goat serxim (PBS/GS) for 1 h at RT, followed by 

binding of the primary monoclonal mouse anti-human CFTR 
(R-domain specific) antibody (Genzyme) diluted 1:500 in 
PBS/GS for 2 h at RT. Cells were washed extensively in 
PBS/GS and incubated for 1 h at RT with a donkey anti- 

35 mouse IgG (H+L) FITC conjugated 
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antibody (Jackson immunoResearch Laboratories) diluted 

1:100 in PBS/GS. 

For southern analysis of vector DHX, 5 Ml aliquot, 
were taken directly f ron CsCl fractions and incubated 
with 20 Ml capsid digestion butfer (50 inM Tris-Cl, pH 
8.0; l.O inH BDTA, pH 8.0; 0.5% SDS, and 1.0 i«g/«l 
proteinase K) at 50-C for 1 h. The reactions were 
allowed to cool to RT, loading dye was added, and 
electrophoresed through a 1.2% agarose gel. Resolved 
DNAi were electroblotted onto a nylon membrane (Hybond-N) 
and hybridized with a 32-P labeled restriction fragment. 
Blots were analyzed by autoradiography or scanned on a 
Phosphorimager 445 SI (Molecular Dynamics) . 

The results that were obtained from Southern blot 
analysis of gradient fractions revealed a distinct viral 
Z^IZ^ Bi^^ated faster than the helper Ad.CBhpAP DKA. 
The highest viral titers mapped to fractions 3 and 4. 
Quantitation of the bands in fraction 4 indicated the 
titer of Ad.CBhpAP was approximately l.5x greater than 
AdACBCFTR. However, if the size difference between the 
two viruses is factored in (Ad.CBhpAP-35 kb; 
AdACBCFrR»6.2 kb) , the viral titer (where 1 ^f^J^^^ 
DNA molecule) of AdACB.CFTR is at least 4-fold greater 
than the viral titer of Ad.CBhpAP. 

While southern blot analysis of gradient fractions 
was useful for showing the production of AdA viral 
particles, it also demonstrated the utility of 
ultracentrifugation for purifying AdA viruses, 
considering the latter of these, both LacZ and CFTR 
transducing viruses banded in CsCl to an intermediate 
density between infectious adenovirus helper virions 
(1.34 g/ml) and incompletely formed capsida (1.31 g/ml). 
The lighter density relative to helper virus likely 
results from the smaller genome carried by the AdA 
viruses. This further suggests changes in virus size 



wo 96/13597 PCTAJS9S/14017 



41 

Influences the density and purification of AdA virus. 
Regardless, the ability to separate AdA virus from the 
helper virus is an important observation and suggests 
further purification may be achieved by successive rounds 
5 of banding through CsCl. 

This recombinant virus is useful in gene therapy 
alone, or preferably, in the form of a conjugate prepared 
as described herein. 

10 EXftffiPlg 5 - Correction of Genetic Defect in CF airway 
Epithelial Cells with AdACB.cPTR 

Treatment of cystic fibrosis, utilizing the 
recombinant virus provided above, is particularly suited 
for in vivo, lung-directed, gene therapy. Airway 

15 epithelial cells are the most desirable targets for gene 
transfer because the pulmonary complications of CF are 
usually its most morbid and life-limiting. 

The recombinant AdACB.CFTR virus was fractionated on 
sequential CsCl gradients and fractions containing CFTR 

20 sequences, migrating between the adenovirus and top 

components fractions described above were used to infect 
primary cultures of human airvay epithelial cells derived 
from the lungs of a CF patient. The cultures were 
subsequently analyzed for expression of CFTR protein by 

25 immxuiocytochemistry. Immunofluorescent detection with 
mouse anti-human CFTR (R domain specific) antibody was 
performed 24 hours after the addition of the recombinant 
virus. Analysis of mock infected CF cells failed to 
reveal significant binding to the R domain specific CFTR 

30 antibody. Primary airway epithelium cultures exposed to 
the recombinant virus demonstrated high levels of CFTR 
protein in 10-20% of the cells. 

Thus, the recombinant virus of the invention, 
containing the CFTR gene, may be delivered directly into 

35 the airway, e.g. by a formulating the virus above, into a 
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preparation which can be inhaled. For example, the 
recombinant virus or conjugate of the invention 
containing the CFTR gene, is suspended 0.25 molar 
sodium Chloride. The virus or conjugate is taXen up by 
5 respiratory airway cells and ti.e gene is expressed. 

Alternatively, the virus or conjugates of the 
invention may be delivered by other suitable »««ns, 
including site-directed injection of the virus bearing 
the CFTR gene. In the case of CFTR gene delivery, 
10 preferred solutions for bronchial instillation are 

sterile saline solutions containing in the range of from 
about 1 X 10' to 1 X 10^0 .ore 1^-^^^^^^^^' ^ 

the range of from about 1 x 10* to 1 x 10« pfu/ml of the 
virus of the present invention. 
15 other suitable methods for the treatment of cystic 

fibrosis by use of gene therapy recombinant viruses of 
this invention may be obtained from the art di-^-^^"^ 
of other types of gene therapy vectors for CF. See, for 
example, U. S. Patent Ho. 5,240,846, incorporated by 
20 reference herein. 

Another version of the helper virus of this 
25 invention is a polylysine conjugate which enables the 

pAdA shuttle plasmid to complex directly with the helper 
virus capsid. This conjugate permits efficient delivery 
of shuttle plasmid pAdA shuttle vector in tandem with the 
helper virus, thereby removing the need for a separate 
30 transfection step. See, Fig. 10 for a diagrammatic 
outline of this construction. Alternatively, such a 
conjugate with a plasmid supplying some Ad genes and the 
helper supplying the remaining necessary genes for 
production of the AdA viral vector provide, a novel way 
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to reduce contamination of the helper virus, as discussed 
above. 

Purified stocks of a large-scale expansion of 
Ad.CBhpAP were modified by coupling poly-L- lysine to the 
5 virion capsid essentially as dtiscribed by K. J. Fisher 
and J. M. Wilson, Btochem, J,. 2aa:49-58 (1994), 
resulting in an Ad.CBhpAP-(Lys),j conjugate. The 
procedure involves three steps. 

First, CsCl band purified helper virus Ad.CBhpAP was 

10 reacted with the heterobifunctional crosslinker sulfo- 
SMCC [sulf o- (N-succinimidyl-4- (N-maleimidomethyl) 
cyclohexane-l-carboxylate] (Pierce) • The conjugation 
reaction, which contained 0.5 mg (375 nmol) of sulpho- 
SMCC and 6 X 10^^ A2eo helper virus particles in 3.0 nl of 

15 HBS, was incubated at 30^C for 45 minutes with constant 
gentle shaking. This step involved formation of a 
peptide bond between the active N-hydroxysuccinimide 
(NHS) ester of sulpho-SHCC and a free amine (e.g. lysine) 
contributed by an adenovirus protein sequence (capsid 

20 protein) in the vector, yielding a maleimide-activated 
viral particle. The activated adenovirus is shown in 
Fig. 10 having the capsid protein fiber labeled with the 
nucleophilic maleimide moiety. In practice, other capsid 
polypeptides including hexon and penton base are also 

25 targeted. 

Unincorporated, unreacted cross-linker was removed 
by gel filtration on a 1 cm x 15 cm Bio-Gel P-6DG (Bio- 
KBd Laboratories) column equilibrated with 50 mN Tris/HCl 
buffer, pH 7.0, and 150 mN NaCl. Peak JI260 fractions 

30 containing maleimide-activated helper virus were combined 
and placed on ice. 

Second, poly-L- lysine having a molecular mass of 58 
kDa at 10 mg/ml in 50 mN triethanolamine buffer (pH 8.0), 
150 uM NaCl and 1 mN EDTA was thiolated with 2- 

35 imminothiolane/HCl (Traut's Reagent; Pierce) to a molar 
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ratio of 2 Boles -SH/mole polylysine under N^; the cyclic 
thioimidate reacts with the poly (L-lysine) primary amines 
resulting in a thiolated polycation. After a 45 minute 
incubation at room temperature the reaction was applied 
5 to a 1 cm X 15 cm Bio-Gel P6DG column equilibrated with 
50 mM Tris/HCl buffer (pH 7.0), 150 mM Had and 2 «M EDTA 
to remove unincorporated Traufs Reagent. 

Quantification of free thiol groups was accomplished 
with Ellman'B reagent [5.5.-dithio-bis-(2-nitroben20xc 
10 acid)), revealing approximately 3-4 mol of -^H/mol of 
poly(L-lysine). The coupling reaction was initiated by 
adding 1 x 10« A^^o particles of maleimide-activated 
helper virus/mg of thiolated poly (L-lysine) and 
incubating the mixture on ice at 4-C for 15 hours under 
15 argon. 2-mercaptoethylamine was added at the completion 
of the reaction and incubation carried out at room 
temperature for 20 minutes to block unreacted maleimxde 

^^^^'virus-polylysine conjugates, Ad.CPAP-p(Lys)„, were 
20 purified away from unconjugated poly (L-lysine) by 

ultracentrifugation through a CsCl step gradient with an 
initial composition of equal volumes of 1.45 g/ml (bottom 
step) and 1.2 g/ml (top step) CsCl in 10 mM Tris/HCl 
buffer (pH 8.0). centrifugation was at 90,000 g f or 2 
25 hours at 5-C. The final product was dialyzed against 20 
tfM Hepes buffer (pH 7.8) containing 150 mM NaCl (HBS) . 

r 1r 7 rnrmntioTi of ^^ '^ ^^' ^ ' ^' '-^'ZJ]^^^^^^^ 

The formation of Ad.CBhpAP-pLys/pAdA.CMVLacZ 
30 particle is initiated by adding 20 pg plasmid 

pAdA.CMVLacZ DNAs to 1.2 X lo" A,eo particles Ad.CBhpAP- 
pLys in a final volume of 0.2 ml DMEM and allowing the 
complex to develop at room temperature for between 10-15 
minutes. This ratio typically represents the plasmxd DNA 
35 binding capacity of a standard lot of adenovirus-pLys 
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conjugate and gives the highest levels of plasmid 
transgene expression. 

The resulting trans- infection particle is 
transfected onto 293 cells (4 x lo'^ cells seeded on a 150 
5 msL dish) . Thirty hours after transfcfction, the particles 
are recovered and subjected to a freeze/thaw technique to 
obtain an extract. The extract is purified on a CsCl 
step gradient with gradients at 1,20 g/ml, 1.36 g/ml and 
1.45 g/nl. After centrifugation at 90,000 x g for 6 
10 hoiirs, the AdA vectors were obtained from a fraction 

under the top components as identified by the presence of 
LacZ, and the helper virus was obtained from a smaller, 
denser fraction, as identified by the presence of hpAP. 

15 Eacampl? 9 - Cgngtrwgtipn 9t MPflif ipd Helpgr yinm with 
Crippled PflcKftqinq fPAC) ff?qwgng»g 

This example refers to Figs. 9 A through 9C, lOA and 
lOB in the design of modified helper viruses of this 
invention. 

20 Ad5 5* terminal sequences that contained PAC domains 

I and II (Fig. 8A) or PAC domains I, 11^ III, and IV 
(Fig. 8B) were generated by PGR from the wild type Ad5 5' 
genome depicted in Fig. IB using PGR clones indicated by 
the arrows in Fig. IB. The resulting amplification 

25 products (Fig. 8A and 8B) sequences differed from the 

wild-type Ad5 genome in the number of A-repeats carried 
by the left (5*) end. 

As depicted in Fig. 8C,^e6e amplification products 
were subcloned into the multiple cloning site of 

30 pAd.Llnk.l (IHGT Vector Core). pAd.Link.l is a 

adenovirus based plasmid containing adenovirus m.u. 9.6 
through 16.1. The insertion of the modified PAC regions 
into pAd.Link.l generated two vectors pAd.PACII 
(containing PAC domains I and II) and pAd.PACIV 

35 (containing PAC domains I, II, III, and IV). 
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20 



25 



30 



35 



Thereafter, as depicted in Figs. lOA and lOB, for 
each of these plasmids, a human placenta alkaline 
phosphatase reporter minigene containing the iimediate 
early CMV enhancer/promoter (CMV) , human placenta 
alkaline phosphatase cDNA (hpAV), and SV40 
polyadenylation signal (pA) , was subcloned into each PAC 
vector, generating pAd.PAClI.CMVhpAP and 
pAd.PAClV.CMVhpAP, respectively. 

These plasmids were then used as substrates for 
homologous recombination with dl7001 virus, described 
above, by co-transfection into 293 cells. Homologous 
recombination occurred between the adenovirus map units 
9-16 of the plasmid and the crippled Ad5 virus. The 
results of homologous recombination were helper viruses 
containing Ad5 5- terminal sequences that contained PAC 
domains 1 and II or PAC domains 1, II, and IV, 

followed by the minigene, and Ad5 3' sequences 9.6-78.3 
and 87-100. Thus, these crippled viruses are deleted of 
the El gene and the E3 gene. 

The plaque formation characteristics of the PAC 
helper viruses gave an immediate indication that the PAC 
modifications diminished the rate and extent of growth, 
specifically, PAC helper virus plaques did not develop 
until day 14-21 post-transfection, and on maturation 
remained small. From previous experience, a standard 
first generation Ad.CBhpAP helper virus with a complete 
left terminal sequence would begin to develop by day 7 
and mature by day 10. 

Viral plaques were picked and suspended in 0.5 ml or 
DMEM media. A small aliquot of the virus stock was used 
to infect a fresh monolayer of 293 cells and 
histochemically stained for recombinant alkaline 
phosphatase activity 24 hours post-infection. Six of 
eight Ad.PACIV.CMVhpAP (encodes A-repeats I-IV) clones 
that were screened for transgene expression were 



wo 96/13597 



PCTAJS95/14017 



47 

positive, while all three Ad.PACII.CMVhpAP clones that 
were selected scored positive. The clones have been 
taken through two rounds of plague purification and are 
currently being expanded to generate a working stock. 
5 These crippled helper vix uses are useful in the 

production of the AdA virus particles according to the 
procedures described in Example 3. They are 
characterized by containing sufficient adenovirus genes 
to permit the packaging of the shuttle vector genome, but 

10 their crippled PAC sequences reduce their efficiency for 
self -encapsidat ion. Thus less helper viruses are 
produced in favor of more AdA recombinant viruses. 
Purification of AdA virus particles from helper viruses 
is facilitated in the CsCl gradient, which is based on 

15 the weight of the respective viral particles. This 

facility in purification is a decided advantage of the 
AdA vectors of this invention in contrast to adenovirus 
vectors having only El or smaller deletions. The AdA 
vectors even with minlgenes of up to about 15 kb are 

20 significantly different in weight than wild type or other 
adenovirus helpers containing many adenovirus genes. 

Example 9 - AdA Vector Containing a full-lenoth 
dystrophin transaene 

25 Duchenne muscular dystrophy (DHD) is a comsaon x- 

linked genetic disease caused by the absence of 
dystrophin, a 427K protein encoded by a 14 kilobase 
transcript. Lack of this important sarcolemmal protein 
leads to progressive muscle wasting, weakness, and death. 

30 One current approach for treating this lethal disease is 
to transfer a functional copy of the dystrophin gene into 
the affected muscles. For skeletal muscle, a 
replication-defective adenovirus represents an efficient 
delivery system. 
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According to the present invention, a recombinant 
plasmid pAdA.CMVmdys was created which contains only the 
Ad5 cis-elements (i.e.. ITRs and contiguous packaging 
sequences) and harbors the full-length murine dystrophin 
gene driven by the CHV promote.-. This plasmid was 
generated as follows. 

PSL1180 [Pharmacia Biotech] was cut with Not I, 
filled in by Klenow, and religated thus ablating the Not 
I site in the plasmid. The resulting plasmid is termed 
PSL1180NN and carries a bacterial ori and Map resistance 
gene. 

pAdA.CMVLacZ of Example 1 was cut with EcoRI, 
klenowed, and ligated with the Apal-cut pSLll80NH to form 

pAdA . CKVLacZ ( Apal ) . 

15 The 14 kb mouse dystrophin cDNA [sequences 

provided in C. C. Lee et al, mts^. 241:334-336 (1991)] 
vas cloned in two large fragments using a lambda ZAP 
cloning vector (Stratagene) and subsequently cloned into 
the bluescript vector pSK- giving rise to the plasmid 

20 pCCL-DMD. A schematic diagram of this vector is provided 
in Fig. 11, which illustrates the restriction enzyme 
sites. 

pAdA.CMVI^cZ (Apal) was cut with WotI and the 
large fragment gel isolated away from the lacZ cDNA. 
25 pCCL-DMD was also cut with NotI, gel isolated and 

BUbseqently ligated to the large NotI fragment of NotI 
digested pAdA.CMVLacZ (Apal) . The sequences of resulting 
vector, PAdA.CMVmdys, are provided in Fig. 12A-12P [SEQ 
ID N0:10]. 

This plasmid contains sequences form the lett 
end of the AdS encompassing bp 1-360 (5- ITR) , a mouse 
dystrophin minigene under the control of the CMV 
promoter, and sequence from the right end of Ad5 spanning 
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bp 35353 to the end of the genome (3* ITR) • The mlnlgene 
is followed by an SV*40 poly-^A sequence similar to that 
described for the plasmids described above. 

The vector production system described herein is . 
5 employed. Ten 150mm 293 platvS are infected at about 90% 
confluency with a reporter recombinant El-deleted virus 
Ad.CBhpAP at an MOI of 5 for 60 minutes at 37 ^C. These 
cells are transfected with pAdA.CMVmDys by calcium 
phosphate co-precipitation using 50 linearized 

10 DNA/dish for about 12-16 hours at 37<'C. Media is 
replaced with DMEM -f 10% fetal bovine serum. 

Full cytopathic effect is observed and a cell lysate 
is made by subjecting the cell pellet to freeze-thaw 
procedures three times. The cells are subjected to an 

15 SW41 three tier CsCl gradient for 2 hours and a band 
migrating between the helper adenovirus and incomplete 
virus is detected. 

Fractions are assayed on a 6 well plate containing 
293 cells infected with 5X of fraction for 16-20 hours in 

20 DMEM + 2% FBS. Cells are collected, washed with 

phosphate buffered saline, and resuspended in 2 ml PBS. 
200X of the 2ml cell fractions is cytospun onto a slide. 

The cells were subjected to immunofluorescence for 
dystrophin as follows. Cells were fixed in ION MeOH at 

25 -20*C. The cells were exposed to a monoclonal antibody 
specific for the carboxy terminus of human dystrophin 
tNCL-DyS2; Novocastra Laboratories Ltd., UK]. Cells were 
then washed three times and exposed to a secondary 
antibody, i.e. 1:200 goat anti-mouse IgG in FITC. 

30 The titer/fraction for seven fractions revealed in 

the immunof luorescent stains were calculated by the 
following formula and reported in Table 2 below. 
DFU/field = (DFU/200X cells) x 10 = DFU/10* cells = 
(DFU/5X viral fraction) x 20 = DFU/IOOX f^raction. 
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10 



1 
2 
3 
4 
5 
6 



Table 2 

pPIT/lOOX 



6 X iC 
1.8 X 10* 
9.6 X 10' 
200 



15 ^ 200 

A virus capable of transducing the dystrophin 
Binigene is detected as a "positive- (i'^' f 

20 fluorescent) cell. The results of the IF illustrate that 
heat-treated fractions do not show positive 
i^unofluorescence. Southern blot data suggest one 
species on the same size as the input DNA, with helper 
virus contamination. 

25 The recombinant virus can be subsequently separated 

from the majority of helper virus by --"^i^-f 
through cesium gradients. Initial studxes demonstrate 
that the functional AdCMVAmDys virions are produced, but 
are contaminated with helper virus. Successful 
30 purification would render AdA virions that are incapable 
rlncoding viral proteins but are capable of transducing 
murine skeletal muscle. 

35 The following experiment provides a method for 

preparing a recombinant AdA according to the invention, 
^tulzing helper viruses from serotypes which differ from 
Tat Of L pAdA in the transfection/ infection protocol 
It is unexpected that the ITRs and packaging sequence of 
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Ad5 could be incorporated into a virion of another 
serotype. 

A. Protocol 

The basic approach is to transfect the 
5 AdA.CMVlac^ recombinant virus (Ad5) into 293 cells and 
subsequently infect the cell with the helper virus 
derived from a variety of Ad serotypes (2, 3, 4, 5, 1$ 8, 
12^ and 40). When CPE is achieved, the lysate is 
harvested and banded through two cesium gradients. 

10 More particularly, the Ad5-based plasmid 

pAdA.CKViacZ of Example 1 was linearized with EcoRI. The 
linearized plasmids were then transfected into ten 150 mm 
dishes of 293 cells using calcium phosphate co- 
precipitation. At 10-15 hours post transfection, wild 

15 type adenoviruses (of one of the following serotypes: 2, 
3, 4, 5, 7, 12, 40) were used to infect cells at an MOI 
of 5. The cells were then harvested at full CPE and 
lysed by three roxinds of freeze-thawing. Pellet is 
resuspended in 4 mL Tris-HCl. Cell debris was removed by 

20 centrifugation and partial purification of Ad5A. CMVIacZ 
from helper virus was achieved with 2 rounds of CsCl 
gradient centrifugation (SW41 column, 35,000 rpm, 2 
hours) . Fractions were collected from the bottom of the 
tube (fraction #1) and analysed for lac^ transducing 

25 viruses on 293 target cells by histochemical staining (at 
2 Oh PI). Contaminating helper viruses were quant itated 
by plaque assay. 

Except for adenovirus type 3, infection with Ad 
serotypes 2, 4, 5, 7, 12 and 40 were able to produce lacZ 

3.0 transducing viruses. The peak of p-galactosidase 
activity was detected between the two major A250 
absorbing peaks, where most of the helper viruses banded 
(data not shown) . The quantity of lacZ virus recovered 
from 10 plates ranged from 10* to 10® transducing 

35 particles depending on the serotype of the helper. As 
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expected Ad2 and Ad5 produced the highest titer of lacZ 
transducing viruses (Table 3). Wild type contamination 
was in general lO^-iO^ log higher than corresponding lac^ 
titer except in the case of Ad40. 
5 B. Pesults 

Table 3 suninarizes the growth characteristics 
of the wild type adenoviruses as evaluated on propagation 
in 293 cells. This demonstrated^ the feasibility of 
utilizing these helper viruses to infect the cell line 
10 which has been transfected with the Ad5 deleted virus. 

Table 3 

Adenovirus serotypes P/ml Pfu/»1 P^P«« 

15 2 5 X lO" 2.5 X lO" 20:01 

1 X lO" 6.25 X 10^ 160:1 

3 X lo" 2 X 10* 150: 1 

1 X lO" 5 X lO" 20:01 

5 X lO" I X lO" 50: 1 



3 

20 * 

5 



35 



7a 



25 12 6 X lO^l 4 X 10« 150:1 

1.2 X lO" 

2.2 X 10" 4.4 X 108 5000:l 



35 1.2 X lO" 



30 40 



40 



Table 4 summarizes the results of the final 
purified fractions. The middle column, labeled LFO/|il 
quantifies the production of lacZ forming units, which x« 
a direct measure of the packaging and propagation of 
pseudotyped recombinant AdA virus. The pfu/Ml ^^^^^^^ 
an estimate of the contaminating wild type virus. AdA 
virus pseudotyped with all adenoviral strains was 
generated except for Ad3. The titers range between 10 - 
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Table 4 
Serotypes LFU/ml 
2 4.6 X 10^ 



PFU/ml 
1.8 X lO' 



10 



IS 



20 



3 
4 
5 
7a 
12 
40 



6.7 X 10* 



6.3 X 10^ 



3 X 10* 
1.2 X 10- 



9.5 X 10^ 



NA 

9.3 X 10^ 
1.9 X 10* 



1.8 X 10 
3.3 X 10* 



1.5 X 10^ 



,8 



25 



Table 5A-5D represents a more detailed analysis 
of the fractions from the second purification for each of 
the experiments summarized in Table 4. Again, UTJ/mI is 
the recovery of the AdA viruses, whereas pfu/Ml 
represents recovery of the helper virus. 



30 



35 



40 



45 



Ad2 Fraction # 



Table 5A 
VOLUNE/Ul LFU/Ul 



PFU/Ul 



1 


120 


9532 


8 


X 10* 


2 


100 


5.8 X 10* 


3 


X 10* 


3 


100 


8.24 X 10* 


6 


X 10^ 


4 


100 


9.47 X 10* 


1.2 


X 10* 


5 


100 


6 X 10* 


8 


X 10* 


6 


100 


2 X 10* 


6 


X 10* 


7 


100 


5434 


5 


X 10* 



Total/ 10 pH 



3.32 X 10^ 



1.35 X 10^ 
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Table 5B 



10 



15 



20 



A&4 Fraction # 
1 
2 
3 
4 
5 
6 

Total/ 10 pH 



VOLUME/Ul 

100 

100 

100 

100 

100 

100 



LFU/Ul 
1000 
1.79 X 10* 
1.8 X 10* 
2909 
920 
153 
4 X 10® 



PFU/ul 
1.75 X 10* 
2.8 X 10* 

5.5 X 10* 
1.25 X 10* 

4 X 10* 
3 X 10^ 

5.6 X 10^ 



25 



30 



35 



40 



XdS Fraction # 
1 
2 
3 
4 
5 
6 

Total/ 10 pH 



120 


1.98 X 10* 


6 


X 


10® 


100 


5.8 X 10* 


3 


X 


10« 


100 


1.2 X 10* 


1.5 


X 


10* 


100 


1 X 10* 


1.4 


X 


10* 


100 


7.96 X 10* 


8 


X 


10* 


100 


6860 


6 


X 


10* 




3.88 X lO** 


1.2 


X 


109 
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Ad7 Fraction # 
1 

■ 2 
3 . 
4 

• 5 
6 
7 

Total/10 pH 
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Table 5C 



VOLUME/Ul 
100 
100 
100 
100 
100 
100 
100 



LFU/Ul 

1225 

5550 

4938 

3866 

4134 

995 

230 
2.09 X 10^ 



PPU/Ul 
5 X 10- 



4 X 10' 



2 X 10- 



8 X 10^ 



6 X 10* 



7 X 10^ 
6 X 10^ 



1.3 X 10* 



25 



30 



35 



Adi2 Fraction # 

' 1 
2 
3 
4 

5- ' . 
Total/ 10 pH 



100 


31 


5 


X 


10* 


80 


169 


8.5 


X 


10* 


80 


245 


1.8 


X 


10* 


110 


161 


1>1 


X 


10* 


120 


62 


7 


X 


103 




6.14 X 10* 


1.65 


X 


io8 
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Table 5D 





Aa40 Fraction # 


VOLIME/Ul 


LFU/Ul 


PFU/Ul 


e 
9 


1 


80 


61 


5 




2 


80 


184 


3 


10 


3 


80 


199 


3 




4 


80 


168 


1 




5 


80 


122 




15 


6 


100 


46 






7 


100 


32 




20 


Total/ 10 pH 




6.65 X 10* 


1.1 X 10^ 







25 y^;nises 

Aliquots of serial fractions were analysed by 
southern blots using lac^ as a probe. In the case of Ad2 
and 5, not only the linearized monoiaer was packaged but 
multiple forms of recombinant virus with distinct sizes 

30 were found. These forms correlated well with the sizes 
of dimers, trimers and other higher molecular weight 
concatamers. The linearized monomers peaked closer to 
the top of tube (the defective adenovirus band) than 
other forms. When these forms were correlated with lacZ 

35 activity, a better correlation was found between the 
higher molecular weight forms than the monomers. With 
pseudotyping of Ad4 and Ad7, no linearized monomers were 
packaged and only higher molecular weight forms were 
found. 

40 These data definitively demonstrate the 

production and characterization of the A virus and the 
different pseudotypes. This example illustrates a very 
simple way of generating pseudotype viruses. 
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Example 11 - AdA Vector Containing a FH Gene 

Familial hypercholesterolemia (FH) is an autosomal 
dominant disorder caused by abnormalities (deficiencies) 
in the function or expression of LDL receptors [H«S. 
5 Brown and J. L. Goldstein, Sclfe.ice . 232 f4746^ :34«>37 
(1986); J.L. Goldstein and n.S. Brown, '^Familial 
hypercholesterolemia" in Metabolic Basis of Inherited 
Disease. . ed. C.R. Scriver et al, McGraw Hill, New York, 
ppl215-1250 (1989) •) Patients who inherit one abnormal 

10 allele have moderate elevations in plasma LDL and suffer 
premature life-threatening coronary artery disease (CAD) . 
Homozygous patients have severe hypercholesterolemia and 
life-threatening CAD in childhood. An FH-containing 
vector of the invention is constructed by replacing the 

15 iacZ minigene In the pAdAcCMVlacZ vector with a ainigene 
containing the LDL receptor gene [T, Yamamoto et al. 
Cell , l£: 27-38 (1984)] using known techniques and as 
described analogously for the dystrophin gene and CFTR in 
the preceding examples. Vectors bearing the LDL receptor . 

20 gene can be readily constructed according to this 

invention* The resulting plasmid is termed pAdAc.CMV- 
LDL. 

This plasmid is useful in gene therapy of FH alone, 
or preferably, in the form of a conjugate prepeured as 
25 described herein to substitute a normal LDL gene for the 
abnormal allele responsible for the gene. 
A. Ex vivo Gene Therapy 

Ex vivo gene therapy can be performed by 
harvesting and establishing a primary culture of 
30 hepatocytes from a patient. Known techniques may be used 
to Isolate and transduce the hepatocytes with the above 
vector (s) bearing the LDL receptor gene(s). For example, 
techniques of collagenase perfusion developed for rabbit 
liver can be adapted for human tissue and used in 
35 transduction. Following transduction, the hepatocytes 
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10 



15 



20 



25 



30 



are removed froir the tissue culture plates and reinfused 
into the patient using known techniques, e.g. via a 
catheter placed into the inferior mesenteric vein. 

Desirably, the in vl.o approach to gene 
therapy, e.g. liver-directed, involves the use of the 
vectors and vector conjugates described above. A 
preferred treatment involves infusing a vector LDL 
conjugate of this invention into the peripheral 
circulation of the patient. The patient is then 
evaluated for change in serum lipids and liver tissues. 

The virus or conjugate can be used to infect 
hepatocytes in vivo by direct injection into a peripheral 
or portal vein (lo'-10« pfu/kg) or retrograde into the 
biliary tract (same dose). This effects gene transfer 
into the majority of hepatocytes. 

Treatments are repeated as necessary, e.g. 
weekly. Administration of a dose of virus equivalent to 
an MOI of approximately 20 (i.e. 20 pfu/hepatocyte) is 
anticipated to lead to high level gene expression xn the 
majority of hepatocytes. 

All references recited above are incorporated herein 
by reference. Numerous modifications and variations of 
the present invention are included in the above- 
identified specification and are expected to be obvious 
to one of Skill in the art. Such modifications and 
alternations to the compositions and processes of the 
present invention, such as various modifications to the 
PAC sequences or the shuttle vectors, or to other 
sequences of the vector, helper virus and minigene 
components, are believed to be encompassed in the scope 
of the claims appended hereto. 
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SEQUENCE LISTING 



(1) GENERAL INFOKMATION: 

(i) APPLICANT: Trustees of the University of Pennsylvania 

Wilson, James M. 
Fisher, Krishna J. 
Chen, Shu- Jen 
Weitzman, Matthew 

(ii) TITLE OP INVENTION: Improved Adenovirus and Methods 

of Use Thereof 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hovson and Howson 

(B) STREET: Spring House Corporate Cntr, PO Box 457 

(C) CITY: Spring House 

(D) STATE: Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/331,381 

(B) FILING DATE: 28-0CT-1994 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Bak, Mary E. . 

(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE/ DOCKET NUMBER: GNVPN.008PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-540-9200 

(B) TELEFAX: 215-540-5818 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GAACTCGAGC AGCTGAAGCT T6AATTCCAT CATCAATAAT ATACCTTATT 
TTGGATTGAA GCCAATATGA TAATGAGGGG GTGGAGTTTG TGACGT6G06 
CGGGGCGTGG GAACGGGGCG GGTGACGTA6 GTTTTAGGGC GGAGTAACTT 
GTATGTGTTG GGAATTGTAG TTTTCTTAAA ATG66AAGTT AC6TAACGTG 
GGAAAACGGA AGTGACGATT TGAGGAAGTT GTGGGTTTTT TGGCTTTC6T 
TTCTGGGCGT AGGTTCGCGT GCGGTTTTCT GGGTGTTTTT TGTGGACTTT 
AACCGTTACG TCATTTTTTA GTCCTATATA TACTCGCTCT GCACTTGGCC 
CTTTTTTACA CTGT6ACTGA TTGAGCTGGT 6CCGTGTCGA GTGGTGTTTT 
TTTAATAGGT TTTCTTTTTT ACTGGTAAGG CTGACTGTTA GGCTGCCGCT 
GTGAAGCGCT GTATGTTGTT CTGGAGCGGG AGGGTGCTAT TTTGCCTA66 
CAGGAGGGTT TTTCAGGTGT TTATGTGTTT TTCTCTCCTA TTAATTTPGT 
TATACCTCCT ATGGGGGCTG TAATGTTGTC TCTACGCCTG CGGGTAT6TA 
TTCCCCCCAA GCTTGCATGC CTGCAGGTCG ACTCTAGAGG ATCCGAAAAA 
ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA TGCAATTGTT 
GTTGTTAACT TGTTTATTGC AGCTTATAAT GQTTACAAAT AAAGCAATAG 
CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT TCTAGTTGT6 
GTTTGTCCAA ACTCATCAAT GTATCTTATC ATGTCTGGAT CCCCGCGGCC 
GCCTAGAGTC GAGGCCGAGT TTGTCAGAAA GCAGACCAAA CAGCGGTTGG 
AATAATAGCG AGAACAGAGA AATAGCGGCA AAAATAATAC CCGTATCACT 
TTTGCTGATA TGGTTGATGT CATGTAGCCA AATCGGGAAA AACGGGAAGT 



50 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

850 

900 

950 

1000 
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AGGCTCCCAT GATAAAAAA6 TAAAAGAAAA 
AAGTTTGTGT TTTTTAAATA GTACATAATG 
GGGCA6ACAT 6GCCTGCCC6 6TTATTATTA 
GGTAATGGTA GCGACC6GCG CTCAGCTGTA 
CTCCAGGA6T CGTCGCCACC AATCCCCATA 
CCATGTGCCT TCTTCCGCGT GCAGCAGATG 
GTTGCT6TTG ACTGTAGCGG CTGATGTTGA 
TGGTGTGGGC CATAATTCAA TTCGCGCGTC 
GCTCGGGAAG AC6TACG6G6 TATACATGTC 
G6TCAAAACA GGCGGCAGTA AGGCGGTCG6 
AATCCGAGCC AGTTTACCCG CTCTGCTACC 
6CCAATCCGC GCCGGATGCG GTGTATCGCT 
TAATC6CCAT TTGACCACTA CCATCAATCC 
AATAAGGTTT TCCCCTGATG CT6CCACGCG 
CGCATCAGCA AGTGTATCTG CCGTGCACTG 
GGTAATGGCC CGCCGCCTTC CAGCGTTCGA 
CGGGTCGCTT CACTTACGCC AATGTCGTTA 
CTGATCGCGC AGCGGCGTCA GCAGTTGTTT 
GTGAAAGAAA GCCTGACTGG CGGTTAAATT 
TCGATGCAAA AATCCATTTC GCTGGTGGTC 
CGCGGCGGGG AGCGTCACAC TGAGGTTTTC 
AGGCGCTGAT GTGCCCGGCT TCTGACCATG 
ACGCGTACTG TGAGCCAGAG TTGCCCGGC6 
AGGCAGTTCA ATCAACTGTT TACCTTGTGG 
CACCGCTTGC CAGCGGCTTA CCATCCAGCG 
TCGTTATCGC TATGACGGAA CAGGTATTCG 



AGAATAAACC 


6AACATCCAA 


1050 


6ATTTCCTTA 


CGCGAAATAC 


1100 


TTTTT6ACAC 


CAGACCAACT 


1150 


A.TCCGCCGA 


TACTGACGGG 


1200 


T6GAAACC6T 


CGATATTCAG 


1250 


GC6AT6GCT6 


CTTTCCATCA 


1300 


ACTGGAAGTC 


GCCGCGCCAC 


1350 


CC6CAGC6CA 


GACCGTTTTC 


1400 


TGACAATGGC 


AGATCCCAGC 


1450 


gatAgttttc 


TTGCGGCCCT 


1500 


TGCGCCAGCT 


GGCAGTTCAG 


1550 


CGCCACTTCA 


ACATCAACG6 


1600 


GGTAGGTTTT 


CCGGCTGATA 


1650 


T6AGCGGTC6 


TAATCA6CAC 


1700 


CAACAACGCT 


GCTTC6GCCT 


1750 


CCCAGGCGTT 


AGGGTCAATG 


1800 


TCCAGCGGTG 


CACGGGTGAA 


1850 


TTTATOGCCA 


ATCCACATCT 


1900 


GCCAA(3GCTT 


ATTACCCA6C 


1950 


AGATGCG6GA 


TGGCGT6G6A 


2000 


CGCCAGACGC 


CACTGCTGCC 


2050 


CGGTCGC6TT 


CGGTTGCACT 


2100 


CTCTCCGGCT 


GCGGTAGTTC 


2150 


AGCGACATCC 


AGAGGCACTT 


2200 


CCACCATCCA 


GTGCAG6AGC 


2250 


CT6GTCACTT 


CGATGGTTTG 


2300 
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CCCGGATAAA 
GCGCTGGATG 
TGGCGATCGT 
GTTGCCGTTT 
CGAAGCCGCC 
GCGAAACCGC 
CAGCGGGCGC 
TCGGCACAGC 
CAAATAATAT 
CGGGCGGGAA 
TAGCGCCGTG 
TGATTACGAT 
TAGCCAGCGC 
TTTCAATATT 
GTGTACCACA 
GTTGTTCTGC 
TGACCTGACC 
AGCAACGGCT 
GCGGAAACCG 
TGTGCAGTTC 
AGTTTCGGGT 
ACCACGCTCA 
CCTGCGTTTC 
AACTCGCCGC 
ATCATTAAAG 
TATGCAGCAA 



CGGAACTGGA 
CGGCGTGCG6 
TCGGCGTATC 
TCATCATATT 
CTGTAAACGG 
CAAGACTGTT 
GTCTCTCCAG 
CGGGAAGGGC 
CGGTGGCCGT 
GGATCGACAG 
GCCTGATTCA 
CGCGCTGCAC 
GGATCATCGG 
GGCTTCATCC 
GCGGATGGTT 
TTCATCAGCA 
ATGCAGAGGA 
TGCCGTTCAG 
ACATCGCAGG 
AACCACCGCA 
TTTCGACGTT 
TCGATAATTT 
ACCCTGCCAT 
ACATCTGAAC 
CGAGTGGCAA 
CGAGACGTCA 



62 

AAAACTGCTG 
TCGGCAAAGA 
GCCAAAATCA 
TAATCAGCGA 
GGATACTGAC 
ACCCATCGCG 
GTAGCGAAAG 
TGGTCTTCAT 
GGTGTCGGCT 
ATTTGATCCA 
TTCCCCAGCG 
CATTCGCGTT 
TCAGACGATT 
ACCACATACA 
CGGATAATGC 
GGATATCCTG 
TGATGCTCGT 
CAGCAGCAGA 
CTTCTGCTTC 
CGATAGAGAT 
CAGACGTAGT 
CACCGCCGAA 
AAAGAAACTG 
TTCAGCCTCC 
CATGGAAATC 
CGGAAAATGC 



CTGGTGTTTT 
CCAGACCGTT 
CCGCCGTAAG 
CTw^ATCCACC 
GAAACGCCTG 
TGGGCGTATT 
CCATTTTTTG 
CCACGCGCGC 
CCGCCGCCTT 
GCGATACAGC 
ACCAGATGAT 
ACGCGTTCGC 
CATTGGCACC 
GGCCGTAGC6 
GAACAGCGCA 
CACCATCGTC 
GACGGTTAAC 
CCATTTTCAA 
AATCAGCGTG 
TCGGGATTTC 
GTGAC6CGAT 
AGGCGCGGTG 
TTACCCGTAG 
AGTACAGCGC 
GCrrGATTTGT 
CGCTCATCCG 



GCTTCCGTCA 
CATACAGAAC 
CCGACCACGG 
CAGTCCCAGA 
CCAGTATTTA 
CGCAAAG6AT 
ATGGACCATT 
GTACATCGGG 
CATACTGCAC 
GCGTCGTGAT 
CACACTCGGG 
TCATCGCCGG 
ATGCCGTGGG 
GTCGCACAGC 
CGGCGTTAAA 
TGCTCATCCA 
GCCTCGAATC 
TCCGCACCTC 
CCGTCGGCGG 
GGCGCTCCAC 
CGGCATAACC 
CCGCTGGCGA 
GTAGTCACGC 
GGCTGAAATC 
GTAGTCGGTT 
CCACATATCC 



2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 

2750 

2800 

2850 

2900 

2950 

3000 

3050 

3100 

3150 

3200 

3250 

3300 

3350 

3400 

3450 

3500 

3550 

3600 
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TGATCTTCCA GATAACTGCC GTCACTCCAA 
6CGGTTTTCT CC6GC6CGTA AAAATGC6CT 
AACGACTGTC CTGGCC6TAA CCGACCCAGC 
AACGCC6AGT TAACGCCATC AAAAATAATT 
CCAGCTTTCA TCAACATTAA ATGTGAGCGA 
CC6TGGGAAC AAACGGCGGA TTGACC6TAA 
TAGATGGGCG CATCGTAACC GTGCATCTGC 
AGTATC6GCC TCA6GAA6AT CGCACTCCAG 
CTGGTGCCG6 AAACCAGGCA AAGCGCCATT 
TGTTGG6AA6 G6C6ATCGGT GCGGGCCTCT 
CAAAGGGGGA T6TGCTGCAA GGCGATTAAG 
CCCA6TCAC6 ACGTTGTAAA ACGACGG6AT 
CTGGTGTCCA GACCAAT6CC TCCCAGACC6 
GTTGGTCAAft GTAAAC6ACA T6GTGACTTC 
TTTCGATCCC CGGGAATTGC GGCCGCGGGT 
GOIGAAGTAA CACTTCCGTA CAGGCCTAGA 
agga6ca6TT CTTTGATTTG CACCACCACC 
AAGACAAAAA GACTAAACTT ACCAGTTAAC 
C6AGTACCG6 ATCCTCTAGA GTCCGGAGGC 
CTATGGAGGT CAAAACAGCG TGGATGGCGT 
CACTAAACGA GCTCTGCTTA TATAGACCTC 
CCATTT6CGT CAATGG66C6 6A6TT6TTAC 
TGATTTT6GT GCCAAAACAA ACTCCCATTG 
TGGAAATCCC CGTGAGTCAA ACCGCTATCC 
AAAACCGCAT CACCATGGTA ATAGCGATGA 
CAAGTAGGAA AGTCCCATAA GGTCATGTAC 



CGCAGCACCA 


TCACCGCGAG 


3650 


CAGGTCAAAT 


TCAGACG6CA 


3700 


6CCCGTT6CA 


CCACAGATGA 


3750 


CvaCGTCTGGC 


CTTCCT6TA6 


3800 


GTAACAACCC 


GTCGGATTCT 


3850 


TGGGATAGGT 


TACGTTGGTG 


3900 


CAGTTTGAGG 


GGACGACGAC 


3950 


CCA6CTTTCC 


6GCACC6CTT 


4000 


CGCCATTCAG 


GCTGC6CAAC 


4050 


TCGCTATTAC 


GCCAGCTGGC 


4100 


TTGGGTAACG 


CCAGGGTTTT 


4150 


C6C6CTTGA6 


CA6CTCCTTG 


4200 


6CAAC6AAAA 


TCACGTTCTT 


4250 


TTTTTTGCTT 


TAGCAG6CTC 


4300 


ACAATTCCGC 


AGCTTTTAGA 


4350 


AGTAAA6GCA 


ACATCCACTG 


4400 


6GATCC6GGA 


CCT6AAATAA 


4450 


TTTCT66TTT 


TTCAGTTCCT 


4500 


T66ATC6GTC 


CCGGTCTCTT 


4550 


CTCCAGGCGA 


TCTGACG6TT 


4600 


CCACCGTACA 


CGCCTACCGC 


4650 


6ACATTTTGG 


AAAGTCCC6T 


4700 


AC6TCAATGG 


GGTGGAGACT 


4750 


ACGCCCATTG 


ATGTACT6CC 


4800 


CTAATACGTA 


GATGTACT6C 


4850 


TGGGCATAAT 


GCCAGGCGGG 


4900 
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CCATTTACCG 
CTTGATGTAC 
CGTCAATGGA 
TGACGTCAAT 
CCGTAAGTTA 
CAAATATTAC 
CTTATTCAGT 
AATTTACTAC 
GTACACCGGC 
ATGTGTTCCG 
ACCCGCCCCG 
TTATCATATT 
CGAATTCATC 
TTAATTTCGA 
GGGGAGAGGC 
ACTCGCTGCG 
AAGGCGGTAA 
CATGTGAGCA 
TGCTGGCGTT 
CGACGCTCAA 
GGCGTTTCCC 
CGCTTACCGG 
TCTCAATGCT 
CAAGCTGGGC 
TATCCGGTAA 
CCACTGGCAG 



TCATTGACGT 
TGCCAAGTGG 
AAGTCCCTAT 
GGGCGGGGGT 
TGTAACGACC 
GCGCTATGAG 
TTTCCCGCGA 
AACATCCGCC 
GCACACCAAA 
CCACACTTGC 
TTCCCACGCC 
GGCTTCAATC 
GATGATATCA 
TAAGCCAGGT 
GGTTTGCGTA 
CTCGGTCGTT 
TACGGTTATC 
AAAGGCCAGC 
TTTCCATAG6 
GTCAGAGGTG 
CCTGGAAGCT 
ATACCTGTCC 
CACGCTGTAG 
TGTGTGCACG 
CTATCGTCTT 
CAGCCACTGG 



64 

CAATAGGGGG 
GCAGTTTACC 
TGGCGTTACT 
CGTTGGGCGG 
TGCAGGTCGA 
TAACACAAAA 
AAATGGCCAA 
TAAAACCGC6 
AAC6TCACTT 
AACATCACAC 
CCGCGCCACG 
CAAAATAAGG 
GATCTGCCGG 
TAACCTGCAT 
TTGGGCGCTC 
CGGCTGCGGC 
CACAGAATCA 
AAAAGGCCAG 
CTCCGCCCCC 
GCGAAACCCG 
CCCTCGTGCG 
GCCTTTCTCC 
GTATCTCAGT 
AACCCCCCGT 
GAGTCCAACC 
TAACAGGATT 



CGTACTTGGC 
GTAAATACTC 
ATGGGAACAT 
TC^.GCCAGGC 
CTCTAGAGGA 
TTATTCAGAT 
ATCTTACTCG 
CGAAAATTGT 
TTGCCACATC 
TTCCGCCACA 
TCACAAACTC 
TATATTATTG 
TCTCCCTATA 
TAATGAATCG 
TTCCGCTTCC 
GAGCGGTATC 
GGGGATAACG 
GAACCGTAAA 
CTGACGAGCA 
ACAGGACTAT 
CTCTCCTGTT 
CTTCGGGAAG 
TCGGTGTAGG 
TCAGCCCGAC 
CGGTAAGACA 
AGCAGAGCGA 



ATATGATACA 
CACCCATTGA 
ACGTCATTAT 
GGGCCATTTA 
TCTCCCTAGA 
TTCACTTCCT 
GTTACGCCCA 
Ca^CTTCCTGT 
CGTCGCTTAC 
CTACTAC6TC 
CACCCCCTCA 
ATGATGCTAG 
GTGAGTCGTA 
GCCAACGCGC 
TCGCTCACTG 
AGCTCACTCA 
CAGGAAAGAA 
AAGGCCGCGT 
TCACAAAAAT 
AAAGATACCA 
CCGACCCTGC 
CGTGGCGCTT 
TCGTTCGCTC 
CGCTGCGCCT 
CGACTTATCG 
GGTATGTAGG 



4950 

5000 

5050 

5100 

5150 

5200 

5250 

5300 

5350 

5400 

5450 

5500 

5550 

5600 

5650 

5700 

5750 

5800 

5850 

5900 

5950 

6000 

6050 

6100 

6150 

6200 
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C6GTGCTACA 


GAGTTCTTGA 


AGTGGTGGCC 


TAACTACGGC 


TACACTAGAA 


6250 


G6ACAGTATT 


TGGTATCTGC 


GCTCTGCTGA 


AGCCAGTTAC 


CTTCGGAAAA 


6300 


AGJ^TTGGTA 


GCTCTTGATC 


CGGCAAACAA 


ACCACCGCT6 


CTA6CG6T6G 


6350 


TTTTTTTGTT 


TGCAAGCAGC 


A6ATTAC6C6 


CmGAAAAAAA 


GGATCTCAAG 


6400 


AAGATCCTTT 


GATCTTTTCT 


ACGGGGTCTG 


ACGCTCAGT6 


GAAC6AAAAC 


6450 


TCACGTTAAG 


GGATTTTGGT 


CATGAGATTA 


TCAAAAAGGA 


TCTTCACCTA 


6500 


GATCCTTTTA 


AATTAAAAAT 


GAAGTTTTAA 


ATCAATCTAA 


AGTATATATS 


6550 


A6TAAACTTG 


GTCTGACA6T 


TACGAAT6CT 


TAATCA6TGA 


66CACCTATC 


6600 


TCAGC6ATCT 


GTCTATTTCG 


TTCATCCATA 


GTT6CCT6AC 


TCCCCGTCGT 


6650 


GTAGATAACT 


ACGATAC6GG 


AGGGCTTACC 


ATCTGGCCCC 


A6T6CTGCAA 


6700 


TGATACC6CG 


AGACCCACGC 


TCACCGGCTC 


CAGATTTATC 


AGCAATAAAC 


6750 


CA6CCAGCC6 


6AA6G6CCGA 


GOGCAGAAGT 


66TCCT6CAA 


CTTTATCC6C 


6800 


CTCCATCCA6 


TCTATTAATT 


GTTGCCG6GA 


AGCTAGA6TA 


A6TAGTTCGC 


6850 


CAGTTAATA6 


TTT6C6CAAC 


GTT6TTGCCA 


TT6CTACA66 


CATC6TGGTC 


6900 


TCACGCTCGT 


CGTTTGGTAT 


G6CTTCATTC 


AGCTCC6GTT 


CCCAAC6ATC 


6950 


AAGGCGAGTT 


ACATGATCCC 


CCATGTTGTG 


CAAAAAAGCG 


GTTAGCTCCT 


7000 


TCGGTCCTCC 


GATCGTTGTC 


AGAA6TAAGT 


TGGCCGCAGT 


GTTATCACTC 


7050 


ATGGTTATG6 


CAGCACTGCA 


TAATTCTCTT 


ACTGTCATGC 


CATCC6TAA6 


7100 


ATGCTTTTCT 


GTGACTGGT6 


AGTACTCAAC 


CAAGTCATTC 


T6A6AATAGT 


7150 


GTATGCGGCG 


ACCGAGTTGC 


TCTTGCCCGG 


CGTCAATAC6 


GGATAATACC 


7200 


GCGCCACATA 


GCAGAACTTT 


AAAAGTGCTC 


ATCATTGGAA 


AACGTTCTTC 


7250 


GGGGCGAAAA 


CTCTCAAGGA 


TCTTACC6CT 


GTTGAGATCC 


A6TTC6ATGT 


7300 


AACCCACTCX; 


T6CACCCAAC 


TGATCTTCAG 


CATCTTTTAC 


TTTCACCA6C 


7350 


GTTTCTGGGT 


GAGCAAAAAC 


AGGAAGGCAA 


AAT6CC6CAA 


AAAAGG6AAT 


7400 


AAGGGCGACA 


CGGAAATGTT 


GAATACTCAT 


ACTCTTCCTT 


TTTCAATATT 


7450 


ATTGAAGCAT 


TTATCAGGGT 


TATTGTCTCA 


TGAGCGGATA 


CATATTTGAA 


7500 
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TGTATTTAGA 

AGTGCCACCT 

AAAATAGGC6 

GGTGAAAACC 

GTAAGCGGAT 

TTGGCGGGTG 

CTGAGAGTGC 

ATACATAACC 



AAAATAAACA 

GACGTCTAAG 

TATCACGAG6 

TCTGACACAT 

GCCGGGAGCA 

TCGGGGCTGG 

ACCATATGGA 

TTATGTATCA 



66 

AATAGGGGTT 
AAACCATTAT 
CCCTTTCGTC 
GCAGCTCCCG 
GACAAGCCCG 
CTTAACTATG 
CATATTGTCG 
TACACATACG 



CCGCGCACAT 

TATCATGACA 

TCGCGCGTTT 

GAJACGGTCA 

TCAGGGCGCG 

CGGCATCAGA 

TTAGAACGCG 

ATTTAGGTGA 



TTCCCCGAAA 
TTAACCTATA 
CGGTGATGAC 
CAGCTTGTCT 
TCAGCGG6TG 
GCAGATTGTA 
GCTACAATTA 
CACTATA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unkno%m 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GAATTCGCTA GCTAGCGGGG GAATACATAC CCGCAGGCGT 
TTACAGCCCC CATAGGAGGT ATAACAAAAT TAATAGGAGA 
TAAACACCTG AAAAACCCTC CTGCCTAGGC AAAATAGCAC 
CAGAACAACA TACAGCGCTT CACAGCGGCA GCCTAACAGT 
AGTAAAAAAG AAAACCTATT AAAAAAACAC CACTCGACAC 
CAATCAGTCA CAGTGTAAAA AAGGGCCAAG TGCAGAGCGA 
GACTAAAAAA TGACGTAACG GTTAAAGTCC ACAAAAAACA 
CGCACGCGAA CCTACGCCCA GAAACGAAAG CCAAAAAACC 
TCAAATCGTC ACTTCCGTTT TCCCACGTTA CGTAACTTCC 
AAACTACAAT TCCCAACACA TACAAGTTAC TCCGCCCTAA 
ACCCGCCCCG TTCCCACGCC CCGCGCCACG TCACAAACTC 



7550 

7600 

7650 

7700 

7750 

7800 

7850 

7897 



AGAGACAACA 
GAAAAACACA 
CCTCCCGCTC 
CAGCCTTACC 
GGCACCAGCT 
GTATATATAG 
CCCAGAAAAC 
CACAACTTCC 
CATTTTAAGA 
AACCTACGTC 
CACCCCCTCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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TTATCATATT 6GCTTCAATC CAAAATAA6G 
CATCATCAAT AATATACCTT ATTTTGGATT 
GGG6TG6AGT TTGT6ACGT6 GCGCX»»»K:G 
TAGTA6TGT6 GCX^GAAGTGT GATGTTGCAA 
6CGACGGATG TGGCAAAAGT GACGTTTTTG 
GAAGTGACAA TTTTC6CGCG GTTTTAGGCG 
CGTAACCGAG TAAGATTTGG CCATTTTCGC 
AGT6AAATCT GAATAATTTT GTGTTACTCA 
6GGA6ATCAG CCTGCA6GTC GTTACATAAC 
GGCTGACCX^C CCAAC6ACCC CCXSCCCATTG 
TCCCATAGTA ACGCCAATAG GGACTTTCCA 
ATTTACGGTA AACT6CCCAC TTGGCA6TAC 
A6TAC6CCCC CTATT6AC6T CAATGACGGt 
TGCCCA6TAC AT6ACCTTAT GGGACTTTCC 
TATTAGTCAT CGCTATTACC ATGGTGATGC 
6GGCGTGGAT AGCGGTTTGA CTCACGG6GA 
TGACGTCAAT GGGAGTTTGT TTTG6CACCA 
AATGTpGTAA CAACTCC6CC CCATT6ACGC 
CGGTGG6A66 TCTATATAA6 CAGA6CTCGT 
CTGGAGACGC CATCCACGCT GTTTTGACCT 
GATCCAGCCT CCGGACTCTA GAGGATCCGG 
CAGAAAGTTA ACTGGTAAGT TTAGTCTTTT 
GGATCCGGT6 GTGGTGCAAA TCAAAGAACT 
TTTACTTCTA GGCCTGTACG GAAGTGTTAC 
AATTGTACCC GCGGCCGCAA TTCCCGGGGA 
AAAAAAGAAG TCACCATGTC GTTTACTTTG 



TATATTATTG 


ATGATGCTA6 


600 


GAA6CCAATA 


TGATAAT6AG 


650 


TGGGAACGGG 


GCGGGTGAC6 


700 


G-^GrGGCGGA 


ACACATGTAA 


750 


GTGTGCGCCG 


GTGTACAGAG 


800 


GATGTTGTAG 


TAAATTTGGG 


850 


GGGAAAACTG 


AATAA6AGGA 


900 


TAGCGCGTAA 


TATTT6TCTA 


950 


TTAC6GTAAA 


T66CCC6CCT 


1000 


ACGTCAATAA 


TGACGTAT6T 


1050 


TTGACGTCAA 


TGGGTGGAGT 


1100 


ATCAA6T6TA 


TCATATGCCA 


1150 


AAATG6CCC6 


CCTGGCATTA 


1200 


TACTT66CA6 


TACATCTAC6 


1250 


GGTTTTGGCA 


GTACATCAAT 


1300 


TTTCCAAGTC 


TCCACCCCAT 


1350 


AAATCAACGG 


GACTTTCCAA 


1400 


AAAT6G6C6G 


TA6GC6TGTA 


1450 


TTAGTGAACC 


GTCAGATCGC 


1500 


CCATAGAA6A 


CACCGGGACC 


1550 


TACTCGAGGA 


ACTGAAAAAC 


1600 


TGTCTTTTAT 


TTCAGGTCCC 


1650 


GCTCCTCAGT 


GGATGTTGCC 


1700 


TTCTGCTCTA 


AAAGCTGCGG 


1750 


TCGAAAGAGC 


CTGCTAAAGC 


1800 


ACCAACAAGA 


ACGTGATTTT 


1850 
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CGTTGCCGGT 
A6CGCGATCC 
ACCCAACTTA 
TAGCGAAGAG 
ATGGCGAATG 
AGCTGGCTGG 
AAACTGGCAG 
ATCCCATTAC 
TGTTACTCGC 
GACGCGAATT 
ACGGGCGCTG 
GACCTGAGCG 
GCTGCGTTGG 
TGAGCGGCAT 
ATCAGCGATT 
TGTACTGGAG 
GGGTAACAGT 
GCGCCTTTCG 
CGTCACACTA 
TCCCGAATCT 
CTGATTGAAG 
AAATGGTCT6 
ACCGTCACGA 
ATGGTGCAGG 
CTGTTCGCAT 
ACGGCCTGTA 



CT6GGAGGCA 
CGTCGTTTTA 
ATCGCCTTGC 
GCCCGCACCG 
GCGCTTTGCC 
AGTGCGATCT 
ATGCACGGTT 
GGTCAATCC6 
TCACATTTAA 
ATTTTTGATG 
GGTCGGTTAC 
CATTTTTACG 
AGTGACGGCA 
TTTCCGTGAC 
TCCATGTTGC 
GCTGAAGTTC 
TTCTTTATGG 
GCGGTGAAAT 
CGTCTGAACG 
CTATCGTGCG 
CAGAAGCCTG 
CTGCTGCTGA 
GCATCATCCT 
ATATCCTGCT 
TATCCGAACC 
TGTGGTGGAT 



68 

TTGGTCTGGA 
CAACGTCGTG 
AGCACATCCC 
ATCGCCCTTC 
TGGTTTCCGG 
TCCTGAGGCC 
ACGATGCGCC 
CCGTTTGTTC 
TGTTGATGAA 
GCGTTAACTC 
GGCCAGGACA 
CGCCGGAGAA 
6TTATCTGGA 
GTCTCGTTGC 
CACTCGCTTT 
AGATGTGCGG 
CAGGGTGAAA 
TATCGATGAG 
TCGAAAACCC 
GTGGTTGAAC 
CGATGTCGGT 
ACGGCAAGCC 
CTGCATGGTC 
GATGAAGCAG 
ATCCGCTGTG 
GAAGCCAATA 



CACCAGCAAG 
ACTGGGAAAA 
CCTTTCGCCA 
CCAACAGTTG 
CACCAGAAGC 
GATACTGTCG 
CATCTACACC 
CCACGGAGAA 
AGCTGGCTAC 
GGCGTTTCAT 
GTCGTTTGCC 
AACCGCCTCG 
AGATCAGGAT 
TGCATAAACC 
AATGATGATT 
CGAGTTGCGT 
CGCAGGTCGC 
CGTGGTGGTT 
GAAACTGTGG 
TGCACACCGC 
TTCCGCGAGG 
GTTGCTGATT 
AGGTCATGGA 
AACAACTTTA 
GTACACGCTG 
TTGAAACCCA 



GAGCTGCTCA 
CCCTGGCGTT 
GCTG6CGTAA 
CGCAGCCTGA 
6GTGCCG6AA 
TCGTCCCCTC 
AACGTAACCT 
TCCGACGGGT 
AGGAAGGCCA 
CTCTGGTGCA 
GTCTGAATTT 
CGGTGATGGT 
ATGTGGCGGA 
GACTACACAA 
TCAGCCGC6C 
GACTACCTAC 
CAGCGGCACC 
ATGCCGATCG 
AGCGCCGAAA 
CGACGGCACG 
TGCGGATTGA 
CGAGGCGTTA 
TGAGCAGACC 
ACGCCGTGCG 
TGCGACCGCT 
CGGCATGGTG 



1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 

2750 

2800 

2850 

2900 

2950 

3000 

3050 

3100 

3150 
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69 

CCAAT6AATC GTCTGACCGA TGATCCGC6C 
ACGCGTAAC6 C6AATGGTGC A6C6C6ATCG 
TCTGCTCGCT GGG6AATGAA TCAGGCCACG 
TATC6CT6GA TCAAATCTGT C6ATCCTTCC 
CGGCGGAGCC GACACCAC66 CCACC6ATAT 
GCGTGGAT6A A6ACCAGCCC TTCCC66CT6 
AAAT6GCTTT C6CTACCT66 AGAGACGC6C 
CGCCCACGC6 ATG66TAACA GTCTTGGC6G 
CGTTTCGTCA 6TATCCCCGT TTACAGG6CG 
GATCAGTCGC TGATTAAATA TGATGAAAAC 
CGGCGGTGAT TTTGGCGATA CGCCGAACX3A 
GTCTGGTCTT T6CCGACCGC ACGCCGCATC 
CACCAGCAGC A6TTTTTCCA GTTCCGTTTA 
6ACCAGC6AA TACCTGTTCC GTCATAGCGA 
TGGTGGCGCT 6GATGGTAAG CCGCTGGCAA 
GTCGCTCCAC AAGGTAAACA GTTGATTjSAA 
6GAGAGCGCC GGGCAACTCT GGCTCACA6T 
CGACCGCATG GTCAGAA6CC G6GCACATCA 
CTGGCGGAAA ACCTCAGTGT GACGCTCCCC 
GCATCTGACC ACCAGCGAAA TGGATTTTTG 
GTTGGCAATT TAACCGCCAG TCAGGCTTTC 
GATAAAAAAC AACTGCT6AC GCCGCTGCGC 
GCTGGATAAC GACATTGGC6 TAAGTGAAGC 
CCTGGGTCGA ACGCTGGAAG GCGGCGGGCC 
TTGTTGCAGT GCACGGCAGA TACACTTGCT 
CGCTCACGCG TGGCAGCATC AGGGGAAAAC 



TGGCTACCGG 


CGATGAGCGA 


3200 


TAATCACCC6 


AGTGTGATCA 


3250 


GCGCTAATCA 


C6AC6C6CTG 


3300 


Cv^CCCGGTGC 


AGTAT6AAGG 


3350 


TATTTGCCC6 


ATGTACGCGC 


3400 


TGCCGAAAT6 


GTCCATCAAA 


3450 


CCGCTGATCC 


TTT6C6AATA 


3500 


TTTCGCTAAA 


TACTG6CAG6 


3550 


GCTTCGTCTG 


G6ACTGG6T6 


3600 


GGCAACCCGT 


GGTCGGCTTA 


3650 


TCGCCAGTTC 


TGTATGAACG 


3700 


CAGCGCTGAC 


66AAGCAAAA 


3750 


TCCGGGCAAA 


CCATCGAAGT 


3800 


TAACGAGCTC 


CTGCACT66A 


3850 


GCGGTGAAGT 


GCCTCT66AT 


3900 


CTGCCTGAAC 


TACCGCAGCC 


3950 


ACGCGTAGTG 


CAACC6AACG 


4000 


GCGCCTGGCA 


GCA6TGGCGT 


4050 


6CCGCGTCCC 


AC6CCATCCC 


4100 


CATCGAGCTG 


GGTAATAAGC 


4150 


TTTCACAGAT 


GTGGATTG6C 


4200 


GATCAGTTCA 


CCC6TGCACC 


4250 


GACCCGCATT 


GACCCTAAC6 


4300 


ATTACCAGGC 


CGAAGCAGC6 


4350 


GAT6CGGTGC 


TGATTACGAC 


4400 


CTTATTTATC 


AGCCGGAAAA 


4450 



wo 96/13597 



PCTAJS95/14017 



CCTACCGGAT 
GTGGCGA6CG 
GGCGCAGGTA 
ACTATCCCGA 
TTGTCAGACA 
CTGCGGGACG 
TCCAGTTCAA 
CATCGCCATC 
TTTCCATATG 
CGGAATTACA 
CAAAAATAAT 
GAAATCCATT 
TATTCTTTTT 
CCGATTTGGC 
TATTTTTGCC 
GTCTGCTTTC 
AGACATGATA 
AGTGAAAAAA 
GTAACCATTA 
TTTTATGTTT 
TAGAGTCGAC 
6CTTCCGGCG 
GGTAGATGAC 
CCAGCCTAAC 
GCCTCGGCGA 
ATACCTTGTC 



TGATGGTAGT 
ATACACCGCA 
GCAGAGCGGG 
CCGCCTTACT 
TGTATACCCC 
CGCGAATTGA 
CATCAGCCGC 
TGCTGCACGC 
GGGATTGGTG 
GCTGAGCGCC 
AATAACCGGG 
ATGTACTATT 
CTTTTACTTT 
TACATGACAT 
GCTATTTCTC 
TGACAAACTC 
AGATACATTG 
ATGCTTTATT 
TAAGCTGCAA 
CAGGTTCAGG 
GACGCGAGGC 
GCATCGGGAT 
GACCATCAGG 
TTCGATCACT 
GCACATGGAA 
TGCCTCCCCG 



70 

GGTCAAATGG 
TCCGGCGCG6 
TAAACTGGCT 
GCCGCCTGTT 
GTACGTCTTC 
ATTATGGCCC 
TACAGTCAAC 
GGAAGAAGGC 
GCGACGACTC 
GGTCGCTACC 
CAGGCCATGT 
TAAAAAACAC 
TTTATCATGG 
CAACCATATC 
TGTOCTCGCT 
GGCCTCGACT 
ATGAGTTTGG 
TGTGAAATTT 
TAAACAAGTT 
GGGAGGTGTG 
TGGATGGCCT 
GCCCGCGTTG 
GACAGCTTCA 
GGACCGCTGA 
CGGGTTGGCA 
CGTTGCGTCG 



CGATTACCGT 
ATTGGCCTGA 
CGGATTAGGG 
TTi^ACCGCTG 
CCGAGCGAAA 
ACACCAGTGG 
AGCAACTGAT 
ACATGGCTGA 
CTGGAGCCCG 
ATTACCAGTT 
CTGCCCGTAT 
AAACTTTTGG 
GA6CCTACTT 
AGCAAAAGTG 
ATTATTCCAA 
CTAGGCGGCC 
ACAAACCACA 
GTGATGCTAT 
AACAACAACA 
GGAGGTTTTT 
TCCCCATTAT 
CAGGCCATGC 
AGGATCGCTC 
TCGTCACGGC 
TGGATTGTAG 
CGGTGCATGG 



TGATGTTGAA 
ACTGCCAGCT 
CCGCAA6AAA 
GGATCTGCCA 
ACGGTCTGCG 
CGCGGCGACT 
GGAAACCAGC 
ATATCGACGG 
TCAGTATCGG 
GGTCTGGTGT 
TTCGCGTAA6 
ATGTTCGGTT 
CCCGTTTTTC 
ATACGGGTAT 
CCGCTGTTTG 
GCGGGGATCC 
ACTAGAATGC 
TGCTTTATTT 
ATTGCATTCA 
TCGGATCCTC 
GATTCTTCTC 
TGTCCAGGCA 
GCGGCTCTTA 
GATTTATGCC 
GCGCCGCCCT 
AGCCGGGCCA 



4500 

4550 

4600 

4650 

4700 

4750 

4800 

4850 

4900 

4950 

5000 

5050 

5100 

5150 

5200 

5250 

5300 

5350 

5400 

5450 

5500 

5550 

5600 

5650 

5700 

5750 
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'1 

CCTCGACCT6 AATG6AA6CC GGCGGCACCT 
CAAGAATTGG A6CCAATCAA TTCTTGCGGA 
AACCCTT66C A6AACATATC CATCGC6TCC 
GCG6CGCATC TCG6GCAGCG TTGGGTCCTG 
TGCTCCTGTC GTTGAGGACC CGGCTAGGCT 
TAGCAGAAT6 AATCACCGAT ACGCGAGCGA 
CAAAACGTCT GC6ACCTGAG CAACAACATG 
TTTC6TAAAG TCTGGAAAC6 C66AA6TCA6 
CGGATCT6CA TCGCAG6AT6 CT6CT66CTA 
TGTATTAAC6 AAGCCTTTCT CAATGCTCAC 
GTGTAGGTCG TTCGCTCOUl GCTG6GCTGT 
GCCC6ACC6C TGCGCCTTAT CC66TAACTA 
TAAGACAC6A CTTATC6CCA CT6GCA6CA6 
AGA6C6A66T AT6TAGGC6G T6CTACA6A6 
CTACGGCTAC ACTAGAAGGA CAGTATTTGG 
CAGTTACCTT CGGAAAAA6A 6TTGGTAGCT 
ACC6CTGGTA GCGGTGGTTT TTTTGTTTGC 
AAAAAAAGGA TCTCAAGAA6 ATCCTTTGAT 
CTCAGTGGAA CGAAAACTCA CGTTAAGGGA 
AAAAGGATCT TCACCTAGAT CCTTTTAAAT 
AATCTAAAGT ATATATGAGT AAACTTGGTC 
TCAGT6AGGC ACCTATCTCA 6CGATCT6TC 
GCCTGACTCC CCGTCGTGTA GATAACTACG 
TGGCCCCAGT GCTGCAATGA TACCGCGAGA 
ATTTATCAGC AATAAACCA6 CCAGCC6GAA 
CCTGCAACTT TATCCGCCTC CATCCAGTCT 



CGCTAAC6GA 


TTCACCACTC 


5800 


GAACTGTGAA 


T6C6CAAACC 


5850 


GCCATCTCXJ^ 


GCA6CCGCAC 


5900 


GCCACGGGT6 


CGCAT6ATC6 


5950 


GGCGGGGTTG 


CCTTACTGGT 


6000 


ACGTGAAGCG 


ACTGCT6CT6 


6050 


AATGGTCTTC 


GGTTTCCGTG 


6100 


C6CCCT6CAC 


CATTATGTTC 


6150 


CCCT6TGGAA 


CACCTACATC 


6200 


6CTGTAGGTA 


TCTCAGTTCG 


6250 


GTGCACGAAC 


CCCCC6TTCA 


6300 


TC6TCTT6A6 


TCCAACCCG6 


6350 


CCACT66TAA 


CAGGATTA6C 


6400 


TTCTT6AA6T 


6GT6GCCTAA 


6450 


TATCTGCGCT 


CTGCTGAAGC 


6500 


CTTGATCCGG 


CAAACAAACC 


6550 


AAGCAGCAGA 


TTACGCGCA6 


6600 


CTTTTCTAC6 


GGGTCT6ACG 


6650 


A X A JiWwX KtfA A 


6AGATTATCA 


6700 


TAAAAATGAA 


GTTTTAAATC 


6750 


TGACAGTTAC 


CAATGCTTAA 


6800 


TATTTCGTTC 


ATCCATAGTT 


6850 


ATACGGGAG6 


GCTTACCATC 


6900 


CCCACGCTCA 


CCGGCTCCAG 


6950 


GGGCCGAGCG 


CAGAAGTGGT 


7000 


ATTAATTGTT 


GCCGGGAA6C 


7050 
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TAGAGTAAGT 
CTGCAGGCAT 
TCC6GTTCCC 
AAAAGCGGTT 
CCGCAGTGTT 
GTCATGCCAT 
GTCATTCTGA 
CAACACGGGA 
ATTGGAAAAC 
GAGATCCAGT 
CTTTTACTTT 
GCCGCAAAAA 
CTTCCTTTTT 
GCGGATACAT 
CGCACATTTC 
CATGACATTA 
AA 



AGTTCGCCAG 
CGTGGTGTCA 
AACGATCAAG 
AGCTCCTTCG 
ATCACTCATG 
CCGTAAGATG 
GAATAGTGTA 
TAATACCGCG 
GTTCTTCGGG 
TCGATGTAAC 
CACCAGCGTT 
AGGGAATAA6 
CAATATTATT 
ATTTGAATGT 
CCCGAAAAGT 
ACCTATAAAA 



72 

TTAATAGTTT 
CGCTCGTCGT 
GCGA6TTACA 
GTCCTCCGAT 
GTTATGCCAG 
CTTTTCTGTG 
T6CGGCGACC 
CCACATAGCA 
GCGAAAACTC 
CCACTCGTGC 
TCTGGGTGAG 
GGCGACACGG 
GAAGCATTTA 
ATTTAGAAAA 
GCCACCTGAC 
ATAGGCGTAT 



GCGCAACGTT 
TTGGTATGGC 
TCATCCCCCA 
CGITGTCAGA 
CACTGCATAA 
ACTGGTGAGT 
GAGTTGCTCT 
CAACTTTAAA 
TCAAGGATCT 
ACCCAACTGA 
CAAAAACAGG 
AAATGTTGAA 
TCAGGGTTAT 
ATAAACAAAT 
GTCTAAGAAA 
CACGAGGCCC 



GTTGCCATTG 
TTCATTCAGC 
TGTTGTGCAA 
AGTAAGTT6G 
TTCTCTTACT 
ACTCAACCAA 
TGCCCGGCGT 
AGTGCTCATC 
TACCGCTGTT 
TCTTCAGCAT 
AA6GCAAAAT 
TACTCATACT 
TGTCTCATGA 
AGGGGTTCCG 
CCATTATTAT 
TTTCGTCTTC 



7100 

7150 

7200 

7250 

7300 

7350 

7400 

7450 

7500 

7550 

7600 

7650 

7700 

7750 

7800 

7850 

7852 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG 
GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT 



50 
100 



wo 96/13597 



PCTAJS9S/14017 



73 



CAGGG6ATAA 


CGCAGGAAAG 


AACATGTGAG 


CAAAAGGCCA 


GCAAAAGGCC 


150 


A66AACCGTA 


AAAAGGCCGC 


GTTGCTGGCG 


TTTTTCCATA 


G6CTCCGCCC 


200 


CCCT6AC6AG 


CATCACAAAA 


ATCGAC6CTC 


AAGTCA6AG6 


TG6C6AAACC 


250 


C6ACA66ACT 


ATAAAGATAC 


CAGGCGTTTC 


CCCCT66AAG 


CTCCCTC6T6 


300 


C6CTCTCCT6 


TTCCGACCCT 


GCCGCTTACC 


GGATACCTGT 


CCGCCTTTCT 


350 


CCCTTCGGGA 


AGCGTGGCGC 


TTTCTCATAG 


CTCACGCTGT 


AGGTATCTCA 


400 


GTTCG6TGTA 


6GTCGTTCGC 


TCCAAGC7TGG 


GCTGTGTGCA 


CGAACCCCCC 


450 


6TTCA6CCC6 


ACCGCTGCGC 


CTTATCCGGT 


AACTATCGTC 


TT6AGTCCAA 


500 


CCC6GTAAGA 


CACGACTTAT 


CGCCACTGGC 


AGCAGCCACT 


GGTAACAGGA 


550 


TTAGCAGA6C 


GA6GTATGTA 


GGCGGTGCTA 


CAGAGTTCTT 


GAAGTGGT6G 


600 


CCTAACTACG 


GCTACACTAG 


AAGAACAGTA 


TTTGGTATCT 


GCGCTCTGCT 


650 


GAA6CCAGTT 


ACCTTCGGAA 


AAAGAGTTGG 


TAGCTCTTGA 


TCCGGCAAAC 


700 


AAACCACCGC 


TGGTAGCGGT 


GGTrTTTTax; 


TTTGCAAGCA 


GCAGATTA06 


750 


CGCAGAAAAA 


AAGGATCTCA 


AGAAGATCCT 


TTGATCTTTT 


CTAC6GGGTC 


800 


T6ACGCTCAG 


TGGAACGAAA 


ACTCACGTTA 


AGGGATTTTG 


GTCATGAGAT 


850 


TATCAAAAAG 


GATCTTCACC 


TAGATCCTTT 


TAAATTAAAA 


ATGAAGTTTT 


900 


AAATCAATCT 


AAAGTATATA 


T6AGTAAACT 


TGGTCTGACA 


GTTACCAATG 


950 


CTTAATCAGT 


GAGGCACCTA 


TCTCAGCGAT 


CTGTCTATTT 


CGTTCATCCA 


1000 


TAGTTGCCTG 


ACTCCCCGTC 


6TGTAGATAA 


CTACGATAC6 


GGAG6GCTTA 


1050 


CCATCT6GCC 


CCAGTGCTGC 


AATGATACCG 


CCAGACCCAC 


GCTCACCG6C 


1100 


TCCAGATTTA 


TCAGCAATAA 


ACCAGCCAGC 


CGGAAGGGCC 


GAGCGCAGAA 


1150 


GTGGTCCTGC 


AACTTTATCC 


GCCTCCATCC 


AGTCTATTAA 


TTGTTGCCGG 


1200 


6AAGCTAGA6 


TAAGTAGTTC 


GCCAGTtAAT 


AGTTTGCGCA 


ACGTTGTT6C 


1250 


CATTGCTACA 


GGCATCGTGG 


TGTCACGCTC 


GTCGTTTGGT 


ATGGCTTCAT 


1300 


TCAGCTCCGC 


TTCCCAACGA 


TCAAGGCGAG 


TTACATGATC 


CCCCATGTTG 


1350 


TGCAAAAAAG 


CGGTTAGCTC 


CTTCGGTCCT 


CCGATCGTTG 


TCAGAA6TAA 


1400 
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GTTGGCCGCA 
TTACTGTCAT 
ACCAAGTCAT 
GGCGTCAATA 
TCATCATTGG 
CTGTTGAGAT 
AGCATCTTTT 
AAAATGCCGC 
ATACTCTTCC 
CATGAGCGGA 
TTCCGCGCAC 
ATTATCATGA 
TCTCGCGCGT 
CGGAGACGGT 
CGTCAGGGCG 
TGCGGCATCA 
TTAATATTTT 
TTTAACCAAT 
GCCCGAGATA 
TAAAGAACGT 
GATGGCCCAC 
GTGCCGTAAA 
CTTGACGGGG 
AAAGGAGCG6 
AACCACCACA 
GTTGCTTTGA 



GTGTTATCAC 
GCCATCCGTA 
TCTGAGAATA 
CGGGATAATA 
AAAACGTTCT 
CCAGTTCGAT 
ACTTTCACCA 
AAAAAAGGGA 
TTTTTCAATA 
TACATATTTG 
ATTTCCCCGA 
CATTAACCTA 
TTCGGTGATG 
CACAGCTTGT 
CGTCAGCGGG 
GAGCAGATTG 
GTTAAAATTC 
AGGCCGAAAT 
GGGTTGAGTG 
GGACTCCAAC 
TACGTGAACC 
GCACTAAATC 
AAAGCCGGCG 
GCGCTAGGGC 
CCCGCC6CGC 
CGTATGCGGT 



74 

TCATGGTTAT 
AGATGCTTTT 
GTGTATGCGG 
CCGCGCCACA 
TCGGGGCGAA 
GTAACCCACT 
GCGTTTCT6G 
ATAAGGGCGA 
TTATTGAAGC 
AATGTATTTA 
AAAGTGCCAC 
TAAAAATAGG 
ACGGTGAAAA 
CTGTAAGCG6 
TGTTGGCGGG 
TACTGAGAGT 
GCGTTAAATT 
CGGCAAAATC 
TTGTTCCAGT 
GTCAAAGGGC 
ATCACCCAAA 
GGAACCCTAA 
AACGTGGCGA 
GCTGGCAAGT 
TTAATGCGCC 
GTGAAATACC 



GGCAGCACTG 
CTGTGACTGG 
CGACCGAGTT 
TAGCAGAACT 
AACTCTCAAG 
CGTGCACCCA 
GTGAGCAAAA 
CACGGAAATG 
ATTTATCAGG 
GAAAAATAAA 
CTGACGTCTA 
CGTATCACGA 
CCTCTGACAC 
ATGCCGGGAG 
TGTCGGGGCT 
GCACCATAAA 
TTTGTTAAAT 
CCTTATAAAT 
TTG6AACAA6 
GAAAAACCGT 
TCAAGTTTTT 
AGGGAGCCCC 
GAAAGGAAGG 
GTAGCGGTCA 
GCTACAGGGC 
GCACAGATGC 



CATAATTCTC 
TGAGTACTCA 
GCTCTTGCCC 
TTAAAAGTGC 
GATCTTACCG 
ACTGATCTTC 
ACAGGAAGGC 
TTGAATACTC 
GTTATTGTCT 
CAAATAGG6G 
AGAAACCATT 
GGCCCTTTCG 
ATGCAGCTCC 
CAGACAAGCC 
GGCTTAACTA 
ATTGTAAACG 
CAGCTCATTT 
CAAAAGAATA 
AGTCCACTAT 
CTATCAGG6C 
TGGGGTCGAG 
CGATTTAGAG 
GAAGAAAGCG 
CGCTGCGCGT 
GCGTACTAT6 
GTAAGGA6AA 



1450 

1500 

1550 

1600 

1650 

1700 

1750 

1800 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 
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75 



AATACCGCAT 


CAGGCGCCAT 


TCGCCATTCA 


GGCTGCGCAA 


CTGTTGGGAA 


2750 


GGGCGATC66 


TGCGGGCCTC 


TTCGCTATTA 


CGCCAGCTGG 


CGAAAGGGGG 


2800 


AT6T6CT6CA 


A6GCGATTAA 


GTTGGGTAAC 


GCCAGGGTTT 


TCCCA6TCAC 


2850 


GAC6TTGTAA 


AACGACGGCC 


AGTGCCAAGC 


1\.?AAG6TGCA 


CGGCCCAOGT 


2900 


GGCCACTAGT 


ACTTCTCGAG 


CTCTGTACAT 


GTCCGCGGTC 


GCGACGTACG 


2950 


CGTATCGATG 


GCGCCAGCTG 


CAGGCGGCCG 


CCATATGCAT 


CCTAGGCCTA 


3000 


TTAATATTCC 


6GAGTATACG 


TAGCC6GCTA 


ACGTTAACAA 


CCGGTACCTC 


3050 


TAGAACTATA 


GCTAGCCAAT 


TCCATCATCA 


ATAATATACC 


TTATTTTGGA 


3100 


TTGAAGCCAA 


TATGATAATG 


AGGGGGTGGA 


GTTT6TGACG 


TGGCGCGG66 


3150 


CGTGGGAACG 


GGGCGGGTGA 


CGTAGGTTTT 


AGGGCGGAGT 


AACTTGTATG 


3200 


TGTTGG6AAT 


TGTAGTTTTC 


TTAAAATGGG 


AAGTTACGTA 


ACGTGGGAAA 


3250 


ACGGAAGTGA 


CGATTTGAGG 


AAGTTGTGGG 


TTTTTTGGCT 


TTCGTTTCTC 


3300 


GGCGTAGGTT 


CGCGT6CGGT 


TTTCT6GGT6 


TTTTTTGTGG 


ACTTTAACCG 


3350 


TTACGTCATT 


TTTTAGTCCT 


ATATATACTC 


GCTCT6CACT 


TGGCCCTTTT 


3400 


TTACACTGTG 


ACTGATTGAG 


CTGGTGCCGT 


GTCGAGTGGT 


GTTTTTTTAA 


3450 


TAGGTTTTCT 


TTTTTACTGG 


TAAGGCTGAC 


TGTTAGGCTG 


CCGCTGTGAA 


3500 


GCGCTGTATG 


TTGTTCTGGA 


GCGGGAGGGT 


GCTATTTTGC 


CTAGGCA6GA 


3550 


GGGTTTTTCA 


GGTGTTTATG 


TGTTTTTCTC 


TCCTATTAAT 


TTTGTTATAC 


3600 


CTCCTATGGG 


GGCTGTAATG 


TTGTCTCTAC 


6CCTGCGGGT 


ATGTATTCCC 


3650 


CCCAAGCTTG 


CATGCCTGCA 


GGTCGACTCT 


AGAGGATCCG 


AAAAAACCTC 


3700 


CCACACCTCC 


CCCTGAACCT 


GAAACATAAA 


ATGAATGCAA 


TTGTTGTTGT 


3750 


TAACTTGTTT 


ATTGCAGCTT 


ATAATGGTTA 


CAAATAAAGC 


AATAGCATCA 


3800 


CAAATTTCAC 


AAATAAAGCA 


ITlTm'CAC 


T6CATTCTAG 


TTGTGGTTTG 


3850 


TCCAAACTCA 


TCAATGTATC 


TTATCATGTC 


TGGATCCCCC 


TAGCTTGCCA 


3900 


AACCTACAGG 


TGGGGTCTTT 


CATTCCCCCC 


TTTTTCTGGA 


GACTAAATAA 


3950 


AATCTTTTAT 


TTTATCTATG 


GCTCGTACTC 


TATAGGCTTC 


AGCTGGT6AT 


4000 
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ATTGTTGAGT 
AATTGGACTA 
CAACATTTAT 
TCTCCTCTTT 
TTCCGGTGGG 
GAAGAGGCTC 
CTTTGTTCTC 
ATCCTGTGTT 
AGTTCTTCTA 
CATCAAGCAG 
ATCAACTGCT 
AAAGTCAAGC 
CATCTGCAAC 
AAGTTTTTTC 
TCCAAAGGCT 
CATCGATCTG 
AACAAAGTAC 
GCCAGGACTT 
CTGTGTATTT 
CAGATGTCAT 
GAGTTGGCCA 
TTGGCATGTC 
TCCACATCTA 
CATGGCTAAA 
TTAAAATGGA 
TCTATTCTCA 



CAAAACTAGA 
ATCGCGGGAT 
GCTGCTCTCT 
CAGAGCAGCA 
GAAAGAGCTT 
CTCTCGTTCA 
TTCTATGACC 
CACAGAGAAT 
ATTATTTGGT 
CAAGATCTTC 
TGTGGCCATG 
TTCCCAGGAA 
TTTCCATATT 
TAAATGTTCC 
TTCCTCCACT 
GATTTCTCCT 
TCTTCCCTGA 
ATTGAGAAGG 
TGCTGTGAGA 
CTTTCTTCAC 
TTCTTGTATG 
AATGAACTTA 
TGCTGGAGTT 
GTCAGGATAA 
AATGAAGGTA 
TTTGGAACCA 



76 

GCCTGGACCA 
CAGCCAATTC 
AAAGCCTTGT 
ATCTGGGGCT 
CACCCTGTCG 
GCAGTTTCTG 
AAAAATTGTT 
TACTGTGCAA 
ATGTTACTGG 
GCCTTACTGA 
GCTTAGGACA 
ACTGTTCTAT 
TCTTGATCAC 
AGAAAAAATA 
GTTGCAAAGT 
TCAGTGTTCA 
TCCAGTTCTT 
AAATGTTCTC 
TCTTTGACAG 
GTGTGAATTC 
GTTTGGTTGA 
AAGACTCGGC 
TACAGCCCAC 
TACCAACTCT 
ACAGCAATGA 
GCGCAGTGTT 



CTGATATCCT 
CATGAGCAAA 
ATCTTGCATC 
TAGACTTGCA 
GAGGGGCTGA 
GATGGAATCG 
GGCATTCCAG 
TCAGCAAATG 
ATCCAAATGA 
GAACAGATCT 
CAGCCCCCAT 
CACAGATCTG 
TCCACTGTTC 
AATACTTTCT 
TATTGAATCC 
GTAGTCTCAA 
CCCAAGAGGC 
TAATATGGCA 
TCATTTGGCC 
TCAATAATCA 
CTTGGTAGGT 
TCACAGATCG 
TGCA^TGTAC 
TCCTTCTCCT 
AGAAGATGAC 
GACAGGTACA 



GTCTTTAACA 
TGTCCCATGT 
TCTTCTTCT6 
CTTGCTT6AG 
TGGCTTGCCG 
TACTGCCGCA 
CATTGCTTCT 
CTTGTTTTAG 
GCACTGGGTT 
AGCCAAGCAC 
CCACAAGGAC 
AGCCCAACCT 
ATAGGGATCC 
GTGGTATCAC 
CAAGACACAC 
AAAAGCTGAT 
CCACCCTCTG 
TTTCCACCTT 
CCCTGAGGGC 
TAACTTTCGA 
TTACCTTCTG 
CATCAAGCTA 
TCATGATATT 
TCTCCTGTTG 
AAAAATCATT 
AGAACCAGTT 



4050 

4100 

4150 

4200 

4250 

4300 

4350 

4400 

4450 

4500 

4550 

4600 

4650 

4700 

4750 

4800 

4850 

4900 

4950 

5000 

5050 

5100 

5150 

5200 

5250 

5300 
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66CA6TATGT AAATTCA6A6 CTTT6TG6AA 
GCC6TCCGAA GGCACGAAGT GTCCATAGTC 
TGAGTGAAAA TT66ACTCCT GCCTTCAGAT 
TGA6GTTT6G AGGAAATATG CTCTCAACAT 
GCACTGTTGC AACAAAGATG TA6GGTT6TA 
GCTCCAATCA CAATTAATAA CAACTGGATG 
CAGAAGGTCA TCCAAAATTG CTATATCTTT 
CACCT6CTTT CAACGTGTTG AGGGTTGACA 
TGTAACATTT TGTG6TGTAA AATTTTC6AC 
CA6T66TAGA CCTCT6AAGA ATCCCATA6C 
CCACGTAAAT GTAAAACACA TAATAC6AAC 
GCATAGCTGT TATTTCTACT ATGAGTACTA 
AGTGTTTCCA AG6AGCCACA GCACAACCAA 
GAAAAATTAC TAA6CACCAA ATTAGCACAA 
GTAATATATC GAAGGTATGT GTTCCATGTA 
CATATCATCA AAAAAGCACT CCTTTAAGTC 
TTATTTCCAA GCCAGTTTCT TGAGATAACC 
TCAGTCAAGT TTGCCTGAGG GGCCAGTGAC 
TGTCTTTCGG TGAATGTTCT GACCTTGGTT 
TCAGGACA6A CTGCCTCCTT CGTGCCT6AA 
ACGCTGATGC GAGGCAGTAT CGCCTCTCCC 
GGACAGCCTT CTCTCTAAAG GCTCATCAGA 
TTTGTAAGGG AGTCTTTTGC ACAATGGAAA 
GGATTGAGAA TAGAATTCTT CCTTTTTTCC 
AAAA6ATTGT TTTTTTGTTT CTGTCCAGGA 
ATGAGAAACG 6TGTAAGGTC TCAGTTA6GA 



CAGA6TTTCA 


AA6TAA66CT 


5350 


CTTTTAAGCT 


TGTAACAAGA 


5400 


TCCAGTTGTT 


TGAGTTGCT6 


5450 


AATAAAAGCC 


ACTATCACT6 


5500 


AAACTGC6AC AACTGCTATA 


5550 


AAGTCAAATA 


TGGTAAGAGG 


5600 


GGAGAATCTA 


TTAAGAATCC 


5650 


TAGGT6CTTG 


AAGAACAGAA 


5700 


ACTGTGATTA 


GAGTAT6CAC 


5750 


AA6CAAAGT6 


TC6GCTACTC 


5800 


TG6TGCTGGT 


GATAATCACT 


5850 


TTCCCTTTGT 


CTT6AAGAGG 


5900 


AGAAGCAGCC ACCTCT6CCA 


5950 


AAATTAAGCT 


CTT6TCGACA 


6000 


GTCACTGCTG 


GTATGCTCTC 


6050 


TTCTTCGTTA 


ATTTCTTCAC 


6100 


TTCTTGAATA 


TATATCCAGT 


6150 


ACTTTTCGTG 


TGGATGCTGT 


6200 


AACTGAGTGT 


GTCATCAG6T 


6250 


GCGTGGGGCC 


AGTGCTGATC 


6300 


TGCTCAGAAT 


CTGGTACTAA 


6350 


ATCCTCTTCG 


ATGCCATTCA 


6400 


ATTTTCGTAT 


AGAGTTGATT 


6450 


CCAAACTCTC 


CAGTCTGTTT 


6500 


GACAGGAGCA 


TCTCCTTCTA 


6550 


TTGAATTTCT 


TCTTTCTGCA 


6600 
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CTAAATTGGT 
CTGTAGATTT 
OITGCAAAAT 
GTGACCAAAA 
AAATATTTCT 
ATAAATACAA 
GCTCGTTGAC 
GTCTTTCTCT 
TGACGCTTCT 
TCTTTAATGG 
TCTTCCACTG 
TCATCATTAG 
AACAACTGTC 
ACCAAGAAGT 
TTTTTCTATT 
CCCTCCTCCC 
TAAGTTATAT 
TTTTGTTTAT 
AATTGCCGAG 
GAATATTTTC 
ATAAAAACAC 
TATCTCACAT 
GTTTTCAATC 
ATTGGATGTT 
TTCCCAGCTC 
CTGAAAAAGG 



CGAAAGAATC 
TGGAGTTCTG 
TAATATTTTG 
TCCTAGTTTT 
TTTTCTGTTA 
ATCAGCATCT 
CTCCACTCAG 
GCAAACTTG6 
GTATCTATAT 
TGCCAGGCAT 
TGCTTAATTT 
AAGTGAAGTC 
CTCTTTCTAT 
GAGAAATTAC 
GTTATTGTTT 
AGAAGGCTGT 
TCCAATGTCT 
TGCTCCAAGA 
TGACCGCCAT 
CGGAGGATGA 
CACAAAGAAC 
AGGCTGCCTT 
ATTTTTTCCA 
CTCGATCATT 
TCTGATCTCT 
GCAAGGACTA 



78 

ACATCCCATG 
AAAATGTCCC 
TCAGCTTTCT 
GTTAGCCATC 
AAACATCTAG 
TTGTATACTG 
TGTGATTCCA 
AGATGTCCTC 
TCATCATAGG 
AATCCAGGAA 
TACCCTCTGA 
TTGCCTGCTC 
CTTGAAATTA 
TGAAGAAGAG 
TGTTTTGCTT 
TACATTCTCC 
TATATTCTTG 
GAGTCATACC 
GCGCAGAACA 
TTCCTTTGAT 
CCTGAGAAGA 
CCGAGTCAGT 
TTGCTTCTTC 
TCTGAGGTAA 
GTACTTCATC 
TCAGGAAACC 



AGTTTTGAGC 
ATAAAAATAG 
TTAAATGTTC 
AGTTTACAGA 
GTATCCAAAA 
CTCTTGCTAA 
CCTTCTCCAA 
TTCTAGTTGG 
AAACACCAAA 
AACTGAGAAC 
AGGCTCCAGT 
CAGTGGATCC 
ATATCTTTCA 
GCTGTCATCA 
TCTCAAATAA 
ATCACTACTT 
CTTTTGTAAG 
ATGTTTGTAC 
ATGCAGAATG 
TAGTGCATAG 
AGAAGGCTGA 
TTCAGTTCTG 
CCAGCAGTAT 
TCACAAGTCT 
ATCATTCTCC 
AAGTCCACAG 



TAAAGTCTG6 
CTGCTACCTT 
CATTTTAGAA 
CACAGCTTTC 
GGAGAGTCTA 
AGAAATTCTT 
GAACTATATT 
CATGCTTTGA 
GATGATATTT 
AGAATGAAAT 
TCTCCCATAA 
AGCAACCGCC 
GGACAGGA6T 
CCATTAGAAG 
TTCCCCAAAT 
CTGTAGTCGT 
AAATCCTGTA 
AGCCCAGGGA 
AGATGGTGGT 
GGAAGCACAG 
GCTATTGAAG 
TTTGTCTTAA 
GCCTTAACA6 
TTCACTGATC 
CTAGCCCAGC 
AAGGCAGACG 



6650 

6700 

6750 

6800 

6850 

6900 

6950 

7000 

7050 

7100 

7150 

7200 

7250 

7300 

7350 

7400 

7450 

7500 

7550 

7600 

7650 

7700 

7750 

7800 

7850 

7900 
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CCTGTAACAA CTCCCAGATT A6CCCCATGA 
GCGATCCACA CGAAATGTGC CAATGCAAGT 
GTTGTTGGAA AGGAGACTAA CAAGTTGTCC 
CACGGCTTGA CAGCTTTAAA GTCTTCTTAT 
ATTCTCATCT GCATTCCAAT GTGATGAAGG 
GAGCAGTGTC CTCACAATAA AGAGAAGGCA 
TCGCGATAGA GCGTTCCTCC TTGTTATCCG 
CTTCCCAGTA AGAGAGGCTG TACTGCTTTG 
AAAGATTCCA TAGAACATAA ATCTCCA6AA 
TAATGAGTTT AGGATTTTTC TTTGAAGCCA 
TCCAATTTTT CAGATAGATT GTCAGCAGAA 
TATGTCTGAC AATTCCAGGC GCTGTCTGTA 
TG6TCCAGCT GAAAAAAAGT TTGGAGACAA 
GACCTCTGCA TGGTCTCTCG GGCGCTGGGG 
GCTCAAGCTC CTAATGCCAA AGGAATTCCT 
GTTCTAGAGC GGCCGCCACC GCGGTGGCTG 
GCGCTTCGCT TTTTATAGGG CCGCCGCCGC 
AAACTTTCGG AGCGCGCCGC TCTGATTGGC 
TCGCCCCGCC CCGCCCCTCG CCCCGCCCCG 
CCCCCCCCCC CCGCCCCCAT CGCTGCACAA 
ATACAAAATT GGGGGTGGGG AGGGGGGGGA 
ACGTGGCCTC GAGTAGATGT ACTGCCAAGT 
TGTACTGGGC ATAATGCCAG GCGGGCCATT 
6GGGGCGTAC TTGGCATATG ATACACTTGA 
TTACCGTAAA TACTCCACCC ATTGACGTCA 
TTACTATGGG AACATACGTC ATTATTGACG 



GGA6TGCCAC 


TTGCAAAGGA 


7950 


CCTTCATCAA 


ATTTGTTCA6 


8000 


AATACTTATT 


TTATCTAGAA 


8050 


AAATCAAACT 


AAACATAGCT 


8100 


CCAAAAATGG 


CTGGGTGTAG 


8150 


TAAGCCTATG 


CCTA6ATAAA 


8200 


GGTCATAGGA 


AGCTATGATT 


8250 


GTGACTTCCC 


CTAAATATAA 


8300 


AAAACATCGC 


CGAAGGGCAT 


8350 


GCTCTCTATC 


CCATTCTCTT 


8400 


TCAACAGAAG 


GGATTTGGTA 


8450 


TCCTTTCCTC 


AAAATTGGTC 


8500 


CGCTGGCCTT 


TTCCAGAGGC 


8550 


TCCCTGCTAG 


GGCCGTCTGG 


8600 


GCAGCCCGGG 


GGATCCACTA 


8650 


ATCCCGCTCC 


CGCCCGCCGC 


8700 


CGCCGCCTCG 


CCATAAAAG6 


8750 


TGCCGCCGCA 


CCTCTCCGCC 


8800 


CCCCGCCTGG 


CGCGCGCCCC 


8850 


AATAATTAAA 


AAATAAATAA 


8900 


GATGGGGAGA 


GTGAAGCAGA 


8950 


AGGAAAGTCC 


CATAAGGTCA 


9000 


TACCGTCATT 


GACGTCAATA 


9050 


TGTACTGCCA 


AGTGGGCAGT 


9100 


ATGGAAAGTC 


CCTATTGGCG 


9150 


TCAATGGGCG 


GGGGTCGTTG 


9200 
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GGCGGTCA6C CAGGCGGGCC ATTTACCGTA AGTTATGTAA CGACCTGCAG 
CCTGATCTCC CTAGACAAAT ATTACGCGCT ATGAGTAACA CAAAATTATT 
CAGATTTCAC TTCCTCTTAT TCAGTTTTCC OGCGAAAATG GCCAAATCTT 
ACTCGGTTAC GCCCAAATTT ACTACAACAT CCOCCTAAAA CCGCGCGAAA 
A'WGTCACTT CCTGTGTACA CCGGC6CACA CCAAAAACGT CACTTTTGCC 
ACATCCGTCG CTTACATGTG TTCCGCCACA CTTGCAACAT CACACTTCCG 
CCACACTACT ACGTCACCCG CCCCGTTCCC ACGCCCCGCG CCACGTCACA 
AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA TAAGGTATAT 
TATTGATGAT GCTA6CATGC GCAAATTTAA AGCGCTGATA TCGATCGCGC 
GCAGATCTGT CATGATGATC ATTGCAATT6 GATCCATATA TAGGGCCCGG 
GTTATAATTA CCTCAGGTCG ACGTCCCATG GCCATTCGAA TTCGTAATCA 
TGGTCATAGC TGTTTCCTGT GTGAAAOTGT TATCCGCTCA CAATTCCACA 
CAACATACGA GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG 
TGAGCTAACT CACATTAATT 6CGTTGCGCT CACT6CCCGC TTTCCAGTCG 
GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG 
. AGGCGGTTTG CGTATTGGGC GC 

(2) INFORMATION FOR SEQ ID NO: 4 I 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
TAGTAAATTT GGGC 



9250 

9300 

9350 

9400 

9450 

9500 

9550 

9600 

9650 

9700 

9750 

9800 

9850 

9900 

9950 

9972 



14 
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(2) INFORMATION FOR 3EQ ID NO:S: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(O) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AGTAA6ATTT 6GCC 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
A6T6AAATCT 6AAT 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GAATAATTTT GTGT 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYFE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 8: 
C6TAATATTT GTCT 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
WANWTTT6 



14 



(2) INFORMATION FOR SEQ ID NO: 10: 

(L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unkno%m 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCAATTCCAT CATCAATAAT ATACCTTATT TTGGATTGAA GCCAATATSA 
TAATGAGGGG GTGGAGTTTG TGACGTGGCG CGGGGCGTGG GAACGGGGCG 
GGTGACGTAG GTTTTAGGGC GGAGTAACTT GTATCTGTTG GGAATTGTAG 
TTTTCTTAAA ATGGGAAGTT ACGTAACGTG GGAAAACGGA AGTGACGATT 
TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT TTCTGGGCGT AGGTTCGCGT 
GCGGTTTTCT GGGTGTTTTT TGTGGACTTT AACCGTTACG TCAinTTTA 
GTCCTATATA TACTC6CTCT GCACTTGGCC CTTPTTrACA CTGTGACTGA 
TTGAGCTGGT GCCGTGTCGA GTGGTGTTTT TTTAATAGGT TTTCTTTTTT 



50 
100 
150 
200 
250 
300 
350 
400 
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ACTGGTAAGG 


CTGACTGTTA 


GGCTGCCGCT 


GTGAAGCGCT 


GTATGTT6TT 


450 


CTGGA6CGG6 


AGGGTGCTAT 


TTTGCCTAG6 


CAGGAGGCrr 


TTTCAGGTGT 


500 


TTAT6T6TTT 


TTCTCTCCTA 


TTAATTTTGT 


TATACCTCCT 


ATGGG6GCTG 


550 


TAATGTT6TC 


TCTACGCCTG 


CGGGTATGTA 


T^CCCCCCAA 


GCTT6CAT6C 


600 


CTGCA66TC6 


ACTCTAGAGG 


ATCCGAAAAA 


ACCTCCCACA 


CCTCCCCCTC 


650 


AACCTGAAAC 


ATAAAATGAA 


TGCAATTGTT 


GTTGTTAACT 


TGTTTATTGC 


700 


A6CTTATAAT 


GGTTACAAAT 


AAAGCAATAG 


CATCACAAAT 


TTCACAAATA 


750 


AAGCATTTTT 


TTCACTGCAT 


TCTAGTTGTC 


GTTTGTCCAA 


ACTCATCAAT 


800 


6TATCTTATC 


ATGTCTGGAT 


CCCCGCGGCC 


GCTCTAGAAC 


TA6T66ATCC 


850 


CCC66GCTGC 


AGGAATTCC6 


TAACATAACT 


GCGTGCTTTA 


TTGAGATACA 


900 


CAGTAAAGCA 


GTAATATAAT 


ACAATAGTAA 


GGCATATATT 


TGGTGAAATC 


950 


TGATATGTTG 


TGAAAATGCA 


GTAAAACTGA 


A6TTTAAAAA 


AATAATTAGT 


1000 


AAAT6TTACA 


GTGTT6GT6T 


TAAAACACAA 


TCTATTATGA 


TACTCAA6TA 


1050 


AGA6TCCAGT 


ACCT6GAGAC 


AATGATGATA 


CATGCCAT6T 


GATGATTATG 


1100 


CTTCAGTTAC 


ACTGATTATG 


Atttacactt 


TAATACTT6A 


TGGTTATAAA 


1150 


GAACATGAAA 


TGATGTCCAA 


ATTATGCTTA 


AAATCAGCAA 


TAAAGCTCTC 


1200 


AGTTTTTATT 


CAAATATTTT 


GATAGATTCA 


CTCCAGAACT 


AATATCTAAA 


1250 


AGATAAAAC6 


AAAAGATTAA 


AACAAAACTA 


TGCACTCTAT 


CTACCTT6GA 


1300 


TTTTA6AATG 


AAACTTAAAA 


CTTCTTAGTA 


GGAAAGGAAC 


CCCTTGTTTT 


1350 


AAATCTTGGT 


GAAAACAAAT 


CCTTGGATAA 


AGAAAATGCC 


CAGTGCCACA 


1400 


TAAAGGAGAG 


AGAGAGAGAA 


AAGCAAGACC 


AGAACCAAAT 


TTCAATTTGT 


1450 


TATCTTAGAG 


CTTTGGGTTT 


TCTTTTGGAA 


ATTATAAATG 


AAAAAAGGAA 


1500 


ACTGGT6TCC 


ACACAACAGA 


CAAGTGGTGA 


A6TTGTGAAA TTAGGTGT6C 


1550 


ACAATTACTA 


GAAACACCCC 


AAAACCAAA6 


TGAGGTAGAA ATAGCAT6A6 


1600 


AAGCTGTGTT 


TGATGTTAAT 


TACAATTAAT 


AATGGACAAA ACCCACTCGC 


1650 


TAGAAGTTAA 


TTACACTTGA 


CGTTAGAGGT 


AACAGATTTG 


CAAAATGATA 


1700 
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GGACAGTGAT 
ACTGGCATGA 
GCACTTTTTC 
CGTCAACTAG 
AACCACTGTG 
AACCAATGCA 
CACCAGGGTC 
CTTGCAGCAT 
AAAATCTAAA 
CAACAACAAA 
ATTTAGTTGT 
CCATTCTTTT 
CGCAACCTTT 
TGTCATCTGA 
AGGAAGTTGA 
TATTCATTCG 
AAAGTTTAAA 
CGGTAGTCTC 
AGATAAGCCC 
ATTTGAAGAA 
TATACTTGTA 
ACTCAGTCAA 
GTTCGTATGC 
CGCTAAAGTA 
TGTGTGTGTG 
GTGTGTGTGT 



TTCTATTGAG 
GAGGAGTAAA 
TCCTGGTTCA 
ACCAGAGAGT 
CCTTCTCACC 
AAGGAGACAA 
ACAATGTGAT 
TAAAAAAAAA 
ATCTAAAATT 
ACCACCCACT 
AATTATAGAG 
ACAGAGTTGT 
AGCTCCTACC 
TGTTCTATTG 
ATATATCAGC 
CATGTTCCTT 
ACTAGAAACA 
CTGGCTTT6G 
AGATGACTAG 
6TAACTTCAT 
GACATATAGA 
CAATTCTCAA 
AAGAAAATAA 
ATGCAAAACA 
GTGGGTTCGT 
GTGTGTGTGC 



84 

AGAATGCTCT 
GCTCTTCCTA 
ATGACTTGCA 
TTGGAGACGC 
CACAATCCTG 
ATGCAGTTCA 
TGGGTTACTT 
AATCATCACA 
CATCATCATC 
TCAGGTTGAG 
ATGTTTATAT 
TGCTCCCCTC 
ATCTTCCTCC 
CAGAAACATC 
CAACAAATTA 
GAAAAAAATG 
TCTGGAGCCC 
GCTCCAGGGA 
AAGCAATTTC 
ATCTATTTAT 
TGTATAAAAT 
AAGAGCAATA 
AAAAACGTCA 
ATGTGCTGCC 
GCATGTATGT 
GTGTGTGTTT 



TTAAATGCTA 
GCAGTCCTTA 
TTTGTTTAGA 
TirTGCTCTC 
TGTGGAGTTA 
TGGGCTTCTG 
TCTTAACAGT 
ATGAAGAAAA 
ATCAACAACA 
TTTATGAAGA 
GTATAGTTGT 
ATATAAATTG 
TACTGTCTGG 
ATTAAATATA 
CTTATGATAGT 
AATCCTCTAG 
TAGACAATAT 
AAATTCACTC 
CATTAGGAAG 
CTATATACCT 
GAAAGCCCAT 
TGAAGCAGTC 
TGAATTCCAT 
TCAGTGTGTG 
GTGCGTGTGT 
GTTTAGGGGT 



AGAAGAAGAA 
GCTTTCTGTT 
CATTTCAGCC 
AAAACTTTCC 
CTTGCAGGGA 
GACTGATATT 
AATCCTAAGT 
AAAAACCCAA 
ACAACAACAA 
GGGCAGAACA 
AAATATTCAT 
ACTGAGGAGC 
GAGTTAAAAA 
ACCCAACA6T 
AAGTCCTGT6 
CTCTCAGTGG 
TTTAGTGTGG 
TTGCCCAAGC 
TGGCAAGAAC 
ATAGTATTTA 
AGCCAGCCCC 
ATTTGGTGG6 
ATGAATACCA 
TGTGTGTGTG 
GTGTGTGTGT 
TTTTATAAAC 



1750 

1800 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 

2750 

2800 

2850 

2900 

2950 

3000 
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AACTTTTTTT ATAAAGCACA CTTTAGTTTA 
TATAAATTTT TAAACAACCC AAAATGCGTT 
TTATTTAGCT ATCAAGATTT TACATGTTTT 
TTGCATA6AC GTGTAAAACC T6CGATTGTT 
AGAAACTACT GAAATCTACA GTATAGTACC 
AGATTTTATT TCTTGTAAAC TCTTACTGTC 
ATATTATAAA AACCATGCG6 GAATCAGGAG 
TCCTTCTTCA TCTGTCATGA CTGAAACTAA 
AATCATCTGC CATGTGGAAA A6GCTTCCTA 
GCTTTCCGGG GGCATTTCTT CCTCTTGAAC 
TGCTCCATCA CTTCTTCTAA CCCTGTGCTT 
AAGATCTTCC TCACCCATAG ATTCTGAA6T 
GCAGCATAGG CTGACTGCTA TCTGACCTCT 
GACACCGTGG TGCCATTCAC CTTA6CTTCA 
CTGTCTCAGT CTATGTAACT GAGACTCCAG 
GGATTTGCAT CCTGGCTTCC AGGCGTCCTT 
GCCTCAGCAA TGAGCTCAGC ATCCCTGGGA 
CATCTCAGGA GGAGATGGCA GTGGAGACAG 
GCTTCAGGC6 ATCATATTCT GCTTGCAGAT 
TCTGCTAGGA TTCTCTCTAG CTCCCCTCTT 
CAAGATCTGG GCAGGACTAC GAGGCTGGCT 
AACTTTGGCA GTAATGCTGG ATTAACAAAT 
TTAGGAGAGA TGCTATCATT TAGATAAGAT 
TGCTAGCCTG CTAGCATAAT GTTCAATGCG 
AAAGCTGGGG GGACGAGGCA GGCGCAGAAT 
AGAGTAACGG GAGTTTCCAT GTTGTCCCCC 



CAATCTCTCT 


TTATAACTGT 


3050 


CCATATAAA6 


AAATGGCAA6 


3100 


CTTTTAACTT 


TTTTGTACAA 


3150 




TAACAGACTT 


3200 


ACTACCCTTC 


ACAAAAATAT 


3250 


TAATCCTCTT 


TGTTGTACGA 


3300 


TT6TAAAACA 


TTTATTCTGC 


3350 


GGACTCCATC 


GCTCTGCCCA 


3400 


CATT6TGTCC 


TCTCTCATT6 


3450 


TA66GAAGGA 


6TT6TT6A6T 


3500 


GT6TCCT6GG 


GAGGACTCAG 


3550 


TTGACT6CCA 


ACCACTCGGA 


3600 


GCA6AGAGGT 


GGAAG6A6A6 


3650 


6CCT6G66CT 


6CTCCA66A6 


3700 


CTGTTTATT6 


T6GTCTTCCA 


3750 


TGTGTTGGCG 


CAGTA6CTTA 


3800 


CTCTGAGGAG 


AGGTGG6CAT 


3850 


GCCTTTATGC 


TCAT6CTGCT 


3900 


TCCTGTTTTC 


TTCCTCAAGA 


3950 


TCCTCACTCT 


dlAAGGAAAT 


4000 


CAGGGGGGAG 


TCCTGGTTCA 


4050 


GTTCATCATC 


TAT6CTCTCA 


4100 


CCATTGCTGT 


TTTCCATTTC 


4150 


T6AATGA6TA 


TCATCGTGTG 


4200 


CTACTGGCCA 


GAAGTT6ATC 


4250 


TCTAACACAG 


TCTGCACTGG 


4300 
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CAGGTAGCCC 
TGTTTTTTAG 
GGAGTGCAAT 
TCGGCCAGAA 
GGCTTCTGTA 
TTGGCTTGAT 
CAGCCACACC 
CAGCTTCAAT 
CTCGGCTCAA 
TGGGATTTGA 
AAAAGCCAGT 
TCCAAGTGTG 
ACGGATCCTC 
GACACATATC 
TCCAGACGAT 
CATGGGCTGG 
CAGCTGACAG 
AGCTTCATGG 
TAAAGACTGG 
GGGTCTCGTG 
TCCCAGGGAC 
AAAGTCCCTG 
CCTGTAGAAG 
TAAGGTGAGA 
GACACGATTG 
TGACTTTTTC 



ATTCGGGGAT 
TACCTTGGCG 
ACTCTACCAT 
AAAAAGCAAC 
CCTGAATCCA 
GCTTGGCAGT 
ATAGACTGGG 
CTCAGGTTTA 
TGTTACTGCC 
ATAGAATCAT 
TGAACTTGCC 
CTTTACACAG 
CCTGTTCGTC 
CACACAGAGA 
CATAAATTGT 
TCATTTTGCT 
GCTCAAGAGA 
CAGTCCTATA 
TAGAGCTCTG 
GTTGATATAG 
CCTGAACTGA 
TGGGCTTCAT 
CCTCCATCTG 
GCTGAATGCC 
ACATTCTCTT 
AAGGTGATCT 



GCTTCGCAAA ATACCTTTTG 
AAGTCGCGAA CATCTTCTCC 
GGGGTAGTGC ATTTTATGGC 
TTTGGCAGAT GTuATAATTA 
ATGATTGGAC ACTCCTTACA 
TTCAGCAGCA GCCACTCTGT 
GTTCCAGGCG CATCCAGTCA 
TTATTGGCAA ATTGGAAGCA 
CCCAAAGGAA GCAACTTCAC 
GCAGAAGAAG ACCCAGCCTA 
ACTTGCTTGA AAAGGTATCT 
AGAAATGATG CCAGTTTTAA 
CCGTATCATA AACATTGAGA 
QGGACATTGA CCAGATTGTT 
AGTCAAACAG TTAATTATCT 
TGAGGTTGTG CTGGTCCAGG 
TCCAAGCAAA GGGCCTTCTG 
CGCGGAGAAC CTGACATTAT 
TCATTTTGGG GTGGTCCCAA 
TAGGGCACTT TGTTTGGTGA 
AGTGGAAAGG AAGTGCTGGG 
GCAGCTGTCT GACACGGTCC 
GTATTCAGAT CTTCCAAAGT 
CAGTGTGGTC AGCTGATGTG 
TAAGAGGTGC AATTTCTCCC 
TGCAGAGAGT CAATGAGGAG 



GTTCGAAATT 
GGATGTAGTC 
CCTTTGCAAC 
AAATGCTTTA 
GATGTTACAC 
GCAAGACGGG 
AGGAAGAGAG 
GCTCCT6ACA 
CCAACTGTCT 
CGCTGGTCAC 
GTACTTGTCT 
AAGACAGGAC 
AGCCAGTTGA 
GTGCTCTT6C 
GCAGGATATC 
GCATCACATG 
GAGCCTTCTG 
TCAGGTCAGC 
CAAGTGGTTT 
GATGGCTCTC 
ATGCAGGACC 
TCCACAGCCA 
GCTGAGGTTA 
CAAGGTCATT 
CGAAGTGCCT 
ATCCCCCACT 



4350 

4400 

4450 

4500 

4550 

4600 

4650 

4700 

4750 

4800 

4850 

4900 

4950 

5000 

5050 

5100 

5150 

5200 

5250 

5300 

5350 

5400 

5450 

5500 

5550 

5600 
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GGCTGCCAGG ATCCCTTGAT CACCTCTAGCT 
TTCATCGGCA GCTTCCT6AA GTTCCTGGAG 
TTTTTCTCT6 CCAATCA6CT GAGC6CAGGT 
TTGACCTCTT CAGCCTGCTT TCGTAGGAGC 
TTCTTCAGGA (SGCAGTTCTC TGGGCTCCTG 
CCAAAGGCTG CTCTGTCAGA AATATTCTCA 
ATTACAGGTT CTTTAGTTTT CAATTCCCTC 
ATTCTGCTTC TGAACTGCTG GGAAATCACC 
TCAGTTCATC ATCTTTCAGC TGTAGCCAAA 
AGATGCAAAC GCTTCCACTG GTCAGAACTT 
GTTGAGAGAC TTTTTCTGAA GTTCACTCCA 
AACGTCTTT6 TAACAGGGGT GCTTCATCCG 
ATTTTTTGGC CATTTTCATC AAGATTGTGA 
AATTTCTCCT TGGAGATCTT GCCATGGTTT 
TGGAGTCTTC TAGGAGCTTC TCCTTACGGG 
GCAGTTGTTT CTGCTTCCGT AATCCAGGAA 
AGGGAACTGC TGCAGTAATC TATGAGTTTC 
CACTTACTCT TTTATGAAT6 TTTCCCCAAG 
ATCATGTGTA CTTTTCTGGT ATCATCAGCA 
CAGTGCCAAA TCATTTGCCA CGTCTACACT 
CTTTGGCCAA CTGCTTGGTT TCTGTGATCT 
GTGTGAGGAC CTTCTTTCCA TGAGTCAAGC 
GACCTGTTCG GCTTCTTCCT TAGCTTCCAG 
ACAOrrTCATT CAACTGTTGT CTCCTGTTCT 
TCCCACTGAA TCTGAATTCT TTCAATTCGA 
TTCTTGATTG CTGGTTTTGT TTTTCAAATT 



TGGCGCAACT 


TGAGGTCCAG 


5650 


TCTTTCAAGA 


GCTTCATCTA 


5700 


TCAATTT6TC 


CCATTCAGC6 


5750 


C(.A6TGACAT 


TCTGAGCTCT 


5800 


GTAGA6TTTC 


TCTAGTCCTT 


5850 


CAGTCTCCAG 


AGTACTCATG 


5900 


TTGAAGGCCC 


TATGTATATC 


5950 


ACCGATGGGT 


GCCTGAC6GC 


6000 


CAAGAAGTTC 


CTGAAGAGAA 


6050 


6CTTCCAAAT 


GGGACCTAAT 


6100 


CTTGAAATTC 


ATGTTATCCA 


6150 


AACCTTCCA6 


GGATCTCAGG 


6200 


TA6ATATCTG 


TGTGAGTTTC 


6250 


CATCAGCTCT 


CTGACTCCCC 


6300 


AAGCGTCCTG 


TAGGACATTG 


6350 


AGAAACTTCT 


CCAGGTCCAG 


6400 


TTCCAAAGCA 


GCCTCTTGCT 


6450 


AAGTATTGAT 


ATTCTCTGTT 


6500 


GAATAGTCCC 


GAAGAAGTTT 


6550 


TATCTGCCGT 


TGACGGAGGT 


6600 


TCTTTTGGAT 


TGCATCTACT 


6650 


TTGCCTCTGA 


CCTGTCCTAT 


6700 


CCATTGTGTT 


GAATCCTTTA 


6750 


GCAGCTGTTC 


TTGAACCTCA 


6800 


TCAGTAAT6A 


TTGTTCTAGC 


6850 


CTGGGCAGCA 


GTAATGAGTT 


6900 
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CTTCCAATTG 
TTGATGATCA 
TGATTTTATA 
AAGCTCGGTT 
GGCATTTCTA 
CACTAGAGTA 
GGGCACGGTC 
GCCTCCCACT 
GCTTGGTTTT 
CATCCGCTTG 
GGTCCTGCCT 
AGAGACCCAC 
CTTTTAAGTG 
ACCTTTATCC 
TTGCTCTTCT 
CATTTAATTG 
AACTTGACTT 
AGTAATAGCA 
AATCTCTTTG 
CTCCTTTCTG 
TCCTAATTTT 
TTATTTCTTC 
CCAATGCCAT 
CCAGTTTTCA 
GCCATTGATT 
ACTGATCTGT 



GGGGCGTCTC 
TTTCATTGAT 
ACTCGATCAA 
GAAGTCTGCC 
GTTTGGAGAT 
ACAGTCTGAC 
AGGCTGCTTT 
CAGACCTCAG 
TCCTTATACA 
TTTACCGTGA 
GACTTGGTTG 
AGAAGCAGGT 
AACCTCAAGC 
ACTGGAGATT 
GGCCTTATGG 
TTTTAGAATT 
GTTCAAGTTG 
ATGTTATCTG 
AAATTCTGAC 
CCAGCTCTTT 
TCTTGTAGAA 
CCCAGTTGCA 
CCTGGAGTTC 
GGATTTTGTG 
AAATACCTTC 
CGAATCGCCC 



88 

TGTTCCAAAT 
GTCTTCCAGA 
GCAGAGACAG 
AGTGCAGGTA 
GACAGTTTCC 
TGGCAGAGGC 
GTCCtCAGCT 
ATCTTCTAAC 
AATGCTGCCC 
ACTGTTACTT 
GTTATAAATT 
GATCCAGCTG 
TCTCCTTGTT 
TGTCT6TTTG 
GAGCACTTAC 
CCCTGGCGCA 
TTCTTTTAGC 
CTTCTTCCAG 
AAGACATTCT 
GCAGATGTCG 
TATTGACATC 
TTCAGTGTTC 
CTTAAGATAC 
TCTTTTTGAA 
ATATCATAAT 
TTGTCGTTCC 



CTTGCAGTGT 
TCACCCACCA 
CCAGTCTGTA 
CCJCCAACAG 
TTAGTAACCA 
TCCAGTAGTG 
CCCGAAGTAA 
TTCCTCTTCA 
TTTCGACAAA 
CAATCTCCTT 
TCCAACTGGT 
CTCTTCAAGC 
TCTCAGGTAA 
AGCTTCTTTT 
AAGTACTGCT 
GGGGCAACTC 
TGCTGCTCAT 
CCACAAAACA 
TTTGTTCTTC 
TGCCACCGCA 
TGTTTTTGAA 
TGACAACA6C 
CATTTGTATT 
AAACTGTTCA 
GAAAGTGTCG 
TTGTACATTC 



TGCCTTCT6T 
TCACTCTCTG 
AGTTCTGTCC 
CAAAGAAGAT 
CAGATTGT6T 
CTCAGTCCAG 
ATGGTTTACA 
CTGGCTGAGT 
AGCCTTTCCA 
TATGTCAAAC 
TTCTAATAGG 
TGCCTAAAAT 
AGCTCTGGAG 
CAAGTTTATC 
CCTCCTGTTT 
TTCTGCCA6T 
CTCCAAGTGG 
AATTCATTTA 
AATCCTCTTT 
GACTCAAGCT 
GACTGTT6AA 
TTGACGCTGC 
TAGCATGTTC 
ACTTCATTCA 
CCATTTTTCA 
TATGAAGTTT 



6950 

7000 

7050 

7100 

7150 

7200 

7250 

7300 

7350 

7400 

7450 

7500 

7550 

7600 

7650 

7700 

7750 

7800 

7850 

7900 

7950 

8000 

8050 

8100 

8150 

8200 
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TTCCCCCTGG AAATCCATCT GTGCCACGGC 
CCATGGAGGT GGCACTTTGC AAGGCTGCT6 
TCAATCCGAC CTGAGATTTG TTGCAAATT6 
CTCCTCTT6C TTAAAAAGAT CTTCAAAATC 
TATTTAGAAG ATGATCAACT TCTGAAAGAG 
TCGGTCAAAT AAGTAGAAGG CACATAAGAA 
AGTCGTCACT ACCATAGTTT CTTCATGGA6 
TGAGTCTTCG AAACTGAGCA AAATTGCTCT 
CTGAGCTGGA TCTGAGTTGG CTCCACTGCC 
CAAGCCCTCA GCTTGCCTGC GCACTGCATT 
GCAATTCACG ATCAATTTCC TTTAATTTTC 
AGGCTGGCTA ATTTTTTTTC AATTTCATCC 
AGCCTGCCTC TTGTACTGAT ACCACTGGTG 
TTCTTCTTTG AGACCTCAAA TCCTTGAGAG 
AGCTGCTGTT TTATCTTTAT TTCCTCTCGC 
TTGTTGTAAG TTGTCTCCTC TTTGCAACAA 
TGTCTTCACT CATATCTTTA TTGAAGTCTT 
T6CTGAATTT CA6CCTCCAG TGGTTCAA6C 
AAACTGCTCC AATTCCTTCA AAGGAATGGA 
TGTGAGAAAT AGCTGCAAAT CGACGGTTGA 
ACTACTTTCC TGCAGTGGTC ACCGCGGTTT 
GTCACGTGTG GAGTCCACCT TTGGGCGCAT 
AACGCTTAAG AATGTCTTCC TTTTGTTGTG 
TCTAAAAGTT CATCTGCATG AATGATCCAC 
CTGATCAAAG GTTTCCATGT GTTTCTGGTA 
ATTCTTCTAC TCTGGAGGTG ACAGCTATCC 



TTCCTGTACT TTCACCTTTT 


8250 


TCTTCTTCTT GTGAATAATA 


8300 


TCTTTTATAT TCTTAA6AGA 


8350 


T*TPAGCACAG AGTTCAGGAG 


8400 


CTTGTAAGAT ATGACTGATC 


8450 


ACATCCAAAG GCATATCTTC 


8500 


A6TGTGAATT TGTGCAAAGT 


8550 


CAATTTGCC6 CCAGCGCTT6 


8600 


ATTGCGGCCC CATTCTCAGA 


8650 


CAGCTCCTCT TTCTTCTTCT 


8700 


TTTCATCTCT GGGTTCAG6T 


8750 


AA6CATTTCA GGAGATCATC 


8800 


AGAAATTTCT AGGGCCTTTT 


8850 


CATTATGTTT TGTCTGTAAC 


8900 


TTTCTCTCAT CTGTGATTCT 


8950 


TTCATTTACA GTACCCTCAT 


9000 


CCTCTTTCA6 ATTCACCCCC 


9050 


AATTTTTGTA TATCTGAGTT 


9100 


GGCCTTTCCA GTCTTAATTC 


9150 


GCTCAGAGAT TTGGGGCTCT 


9200 


QCCATCAATT TTGCTGCTTG 


9250 


GTCATTCATT TCAGCCTTTA 


9300 


GTTTCTTCTT TTCAGACTCA 


9350 


TTTGTGATTT GTTCTATGTT 


9400 


TTCCAACAAA AGATTTAGCC 


9450 


AGTTACTGTT CAGAAGACTC 


9500 
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90 

9550 



9850 
9900 
9950 



AGTTTATCTT CTACCAAG6T TTCTTTCTTG CCCAACACCA TTTTCAAAGA 
CTCTCCTAAT TCTGTAACAC TCTTCAAGTG AGCCTTCTGT TTCTCAATCT 9600 
CTTTTTGAGT AGCCTTTCCC CAGGCAACTT CAGAATCCAA ATTACTTGGC 9650 
ATTCCTTCAA CTGCTGATCT CTTCGTCAAT TCx5TATCTG TTGCTGCCAG 9700 
CCATPCTGTT AAGACATTCA TTTCCTTTCT CATCTTACGG GACAACTTCA 9750 
AGCAWTCTC CAACTGTTGC TTTCTCTCTG TTACCTTCGC ACCCAACTCA 9800 
TTGTAAT6CA ATTTCAAA6C TGTTACTCGT TCATCAAGCT CTTTGGGATT 
TTCTGTCTGC TTTTTCTGTA CAATTTGACG TCCGGTTTTA ATCACCATTT 
CCACTTCAGA CTOGACTTCA CTCAGGCTTT TATACAA6TT CACACAATGA 
CTTAGTTGTG ACI^AATTAC ITCCTGTTCA ACACTCTTGG TTTCCAATGC 10000 
AGGCAAATGC ATCTTGACTT CATCTAAAAT CATCTTACTT TCCTCTAGAC 10050 
GTTGTTCAAA ATTGGCTGGT TTTTGGAATA ATCGAAATTT CATGGAGACA 10100 
TCTTGTAATT TTTTCTGTGC AACATCAATT TGTGAAAGAA CCCTTTGGTT 10150 
GGCATCCTTC CCCTGGITAT GTTTCT1X:AT TTCTTCTAAA CTTATCTCAT 10200 
GACTTGTCAA ATCTGATTGG ATTTTCTGGG CTTCCTGAGG CATTTGAGCT 10250 
GCATCCACCT TGTCAGTCAT ATAAGCTGCC AACTGCTTGT CAATGAATTC 10300 
AAGCGACTCC TCAATTAAGT GCAAGGACTT TTCAATTTCC TGGGCAGACT 10350 
GGATACTCTG TTCAAGCAAC TITPGTTTCC TCACAGCCTC TTCATGTAGT 10400 
TCCCTCCAAC GAGAATTAAA CGTCTCAAGC TCCTCATTGA TCAGTTCATC 10450 
CATGACTCCT CCATCTGTAA GAGTCWTGC CAATAGACGA ATCWATTTG 10500 
GGTTCTCCTC TGAATGATGC ATCAGATTIT CAAGAGATTC TAGCACTTCA 10550 
GTGATTTCCT CAGGTCCTGC AGGAACATTT TCCATGGTTT TAAGTTTCAA 10600 
TTCTACTTCA TTGAGCCACT TCTTTGCTTT CTCTAAATAT GACAATAACT 10650 
CA1H5CCAACA TGCCCAAACT TCTTCCAAAG TTTTGCATTT TCCATTCAGC 10700 
CTGGTGCACA GCCATTGGTA GrPGGTGGTC AGAGTTTCAA GTTCCnTTT 10750 
TAAGGCCTCT TGTGCTGAGG GTGGAGCGTG AGCTATTACA CTATTTACAG 10800 
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TCTCAGTAA6 GAGTTTCACT TTAGTTTCTT 
GCTCTCTTCA TTTCTTCAAC AGCAGTCTGT 
TTCAAAATCT CTCTCTA6AT A17CTTCTTC 
GCATCTCTGA TAGATCTTTT TGGAGGCTTA 
TTTAAGGCTT CCTTTCTGGT GTAGACCTGG 
AGTGTTAAGC TCTCTAAGTT CTGTCTCCAG 
CAGCTTCACT CTTTATCTTC TGCCCACCTT 
GGCTGAATT6 TTTGAATATC ACCAACTAAA 
TTTTTTCAGG ATTTCAGCAT CCCCCAGGGC 
AAACATCAAC TTCAGCCATC CATTTCTGTA 
AATTTTCGAA GTTTATTCAT ATGTTCTTCT 
CAACTGGGA6 GAAAGTTTCT TCCAGTGCCC 
ACAGATATTT CTGGCATATT TCTGAAGGTG 
ACAGTGTCAC TCAGATA6TT GAA6CCATTT 
TTGCAGAGCC TGTAATTTCC CGAGTCTCTC 
TAACACTAAG ATAAGGTACA GAGAGTTTGC 
GTCCTGATGC TACTCATTGT CTCCTGATAG 
AAAAATTGTC TGTAGCTCTT TCTCTTTGGC 
GGTTAAAATG ATTAGTAAAG GCCACAAAGT 
CCCTGTCCCT TTTCTTTCAG TTGTAGACTC 
TTGAGGCTGA AGAGCTGACA ATCTGTTGAC 
ACTGGCTTTt AATTGCTGTT GGCTCTGATA 
AACAAGTTTT CGGCAGTAGT TGTCATCTGT 
ATAAAAGGTA ATGATGTTGG TTTGATACTC 
TCAGCAATTG GCAGAATTCT GTCCACCGGC 
TGTCTGATAC TTTCAGCATT AACACCCTCA 



TTTGTAGTGC 


CTCTTCTTTA 


10850 


AATTCATCTG 


GA6TTTTATA 


10900 


AGCTTGT6TC 


ATCCACTGAT 


10950 


CGwTTTTATC 


CAAACCT6CC 


11000 


CGGCATATGT 


GATCCCACTG 


11050 


TCTGGATGCA 


AACTCAAGTT 


11100 


CATTAACACT 


ATTTAAACT6 


11150 


AGTCTGCATT 


GTTTGAGCTG 


11200 


AGGCCATTCC 


TCTTTCA6GA 


11250 


AGGTTTTTAT 


GTGATTCTGA 


11300 


AGCTTTTGGC 


AGCTTTCCAC 


11350 


CTCAATCTCT 


TCAAATTCTG 


11400 


CTTTCTTG6C 


CATCTCCTTC 


11450 


TGTTGCTCTT 


TCAAA6AACT 


11500 


CTCCATTATT 


TCATATTCAG 


11550 


TTTCTGACTG 


CTGGATCCAC 


11600 


CGCATT6GTG 


GTAAAGTGTC 


11650 


CCTCACACCA 


TCAAAGATGT 


11700 


CTGCATCCAG 


AAACATT6GC 


11750 


TGAATTTTTA 


ATTGCTCAAT 


11800 


T!TCATCCTTA 


CAAATTTTTA 


11850 


GGGTGGTAGA 


CTGGGTTTTC 


11900 


TCCAATTGTT 


GTAGCT6ATT 


11950 


TAGCCAGTTA 


ACTCTCTCAC 


12000 


TGTTCAGTTG 


TTCTGAAGCT 


12050 


TTTGCCATCT 


GTTCCACCAG 


12100 
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GGCCTGAGCT GATCTGCTGC CATCTTGCAG TTTTCTGAAC TTCTCTGCTT 12150 
TTTCTCGTGC TATGGCATTG ACTTTTTCTT GCAAGTCTGA GATGTTGCCT 12200 
TCTTTTCGAT AGACTGCAAA TTCAGAACTC TGTAATACAG CTTCTGAACG 12250 
AGTAATCCAA CTGTGAAGTT CAGTTATATC GAJATCCAAC CTTTTCCTGA 12300 
GTTCAGAATC CACAGTTATC TGCCTCTTCT TTTGAGGAGG TGGTGGTGGA 12350 
AGTTCCTCTT GGGCATGTTT TACCATGATT TGTTCCCTTG TGGTCACCAT 12400 
AGTTACCGTT TCCATTACAG TTGTCTGTGT TAGGGATGGT TGAGTGGTGG 12450 
TGACAGCCTG TGAAATTTGT GCTGAACTCT TTTCAAGTTT TTGGGTTAAA 12500 
TTGTCCCAAC GTTGTGCAAA GTTTTCCATC CAGATTTCCA TCTTTTGA6T 12550 
CACTGACTTA TTTTTCAGTG CCGAAAGTAG ATCTTGATTG AGTGAACTTA 12600 
GTTTTTCCAT 6GTTGGCTTT TTCTTTTCTA GATCTATTTT TAAAGTAGAT 12650 
ATTTTGTGAA GACTTGACAT CATTTCATTT TGATCTTTAA AGCCACTTGT 12700 
CTGAATGTTC TTCATTGCAT CTTCTTTTTC TGAAA6CCAT GTACTAAAAA 12750 
GGCACTGTTC TTCAGTAAAA TGCTGCCATT TTAGAAGAAT ATCTTGTAAA 12800 
ACAATCCAGC GGTCTTCAGT CCATCTGCA6 ATATTTGCCC ATCGATCTCC 12850 
CAGTACCTTA AGTTGTTCTT CCAAAGCAGC TGTTGCATGA TCACCGCTGG 12900 
ATTCATCAAC CACTACTACC ATGTGAGTGA GCGAGTTGAC CCTGACCTGC 12950 
TCCTGTTCTA GATCTTCTTG AAGCACCTTA TGTTGTTGTA CTTGGCATTT 13000 
TAGATCTTCA AGATCAGGTC CAAAGGGCTC TTCCTCCATT TTCTTAGTTC 13050 
TCTCTTCAGT TTTTGTTAAC CAGTCATCTA GTTCTTTTAA TTTCTGATTC 13100 
TGGAGATCCA TTAGAACTTT GTGTAATTTG CTrTGTTrTT CCATGCTAGC 13150 
TACCCTGAGA CATTCCCATC TTGAATTTAG GAGATTCATT TGTTCTTGCA 13200 
COTCAGCTTC TTCATCTTCT GATAATTTCC CTTTTCCAA^ "250 
CCTAACTGTA GAACATTACC AACAAGTCCT TGATGAGATG TCAGATCCAT 13300 
CATGAATCCC TCATGAGCAT GAAACTGTTC TTTCACTTCT TCAACATCAT 13350 
TTGAAATCTC TCCPTGTGCT CGCAATGTAT CCTCGGCAGA AAGAAGCCAT 13400 
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GAAAGTACTT CTTCTAAAGC AGTTTGGTAA 
CTCCATCAAT GAACTGTCAA GTGACTTGTC 
6TGAAGGATA 66G6CTCTGT GTG6AATCA6 
TGTGTGAAG6 CATAACTCTT GAATCGAGGC 
TTCATAGCCC TGTGCTAGAC TGACTGTGAT 
GGTGATGTAA TTGAAAATGT TCTTCTCTAG 
GGCAACATTT CCACTTCTTG AATGGCTTCA 
AACTTGAAAG AGTGATGTGA TGTACATTAA 
AAGTGGTAGC AACATCTTCA GGATCAAGAA 
CATTTTGCAA TGTTGAAGGC ATGTTCCAGT 
TGAAACCACA CTATTCCAAT CAAACAGGTC 
GAGCATTCAA AGCCAACCC6 TCGGACCAGC 
TTAACCTGTG GATAATTACG TGTTGACTGT 
CTTTTCACTG TTGGTTTGCT GCAATCCAGC 
TTTTGACCTG CCAGTGGAGG ATTATATTCC 
TTATGATTTC CATCCACTAT GTCAGTGCTT 
ATTATTTTTC TGTAAGACCC 6CAGTGCCTT 
GAACTCTTGT AGATCCCTTT TCTTTTG6CA 
TCCAAGAGGT CTAGGAGGC6 TTTTCCATCC 
GTCTATGTGT TGCTTTCCAA ACTTAGAAAA 
TGAATGTTTT CTTTTGAACA TCTTCTCTTT 
TCCCACCAAA GCATTTGGAA GAAAAAGTAT 
ATCTTGGTAA AAGTTTCTCC CAGTTTTATT 
GATGAGAAGC CAATAAACTT CAGCAGCCTT 
TAGCACTTCA AGTCTTCCTA TTCGTTTTTT 
AGAGCGGAAT TCCTGCAGCC CGGGGGATCC 



CTATCCAGAT 


TTACTTCCGT 


13450 


TCT66GAGCT 


TCCAAATGCT 


13500 


A6GT66CAAC 


ATAAGCA6CC 


13550 


TTlAGGAGATG 


AAGAAGTTT6 


13600 


CT6TTGAGAG 


TAAT6CATCT 


13650 


TTACTTTTGA 


AGATGTCCTG 


13700 


ATGCTCACTT 


GTT6T6GCAA 


13750 


GATGGACTTC 


TT6TCTG6AT 


13800 


GTTTTTCTAT 


GCCTAACT66 


13850 


CTTTGGGT66 


CTGAGTGCT6 


13900 


GGGCCTGTGA 


CTATGGATAA 


13950 


TAGAG6T6AA 


GTT6ATGAC6 


14000 


CGAACCCA6C 


TCA6AA6AAT 


14050 


CATGATA6TT 


TTCATCACAT 


14100 


AAATCAAACC 


AAGAGTGAGT 


14150 


CCTATATTCA 


CTAAATCAAC 


14200 


6TT6ACATT6 


TTCAGGIGCAT 


14250 


6TTTTT6CCC 


TGTAAGGCCT 


14300 


TGCA6GTCAC 


T6AA6AGGTT 


14350 


TTGTGCATTT 


ATCCATTTT6 


14400 


CATAACAGTC 


CTCTACTTCT 


14450 


ATATCAAG6C 


AGGGATAAAA 


14500 


GCTCCA66AG 


GCTTA6GTAC 


14550 


GACAAAAAAA 


AAAAAAAAAA 


14600 


CTATAAAGCT 


ATTGCCTTCA 


14650 


ACTAGTTCTA 


GAGCGGCC6C 


14700 
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GGGTACAATT CCGCAGCTTT TAGAGCAGAA GTAACACTTC CGTACAGGCC 14750 
TAGAAGTAAA GGCAACATCC ACTGAGGAGC AGTrCTTTGA TTTGCACCAC 14800 
CACCGGATCC GGGACCTGAA ATAAAAGACA AAAAGACTAA ACTTACCAGT 14850 
TAACTTTCTG GTTTTTCAGT TCCTCGAGTA CCXGATCCTC TA6AGTCCGG 14900 
AGGCTGGATC GGTCCCGGTG TCTTCTATG6 AGGTCAAAAC AGCGTG6ATG 14950 
GCGTCTCCAG GCGATCTGAC GGTTCACTAA AC6AGCTCT6 CPTATATAGA 15000 
CCTCCCACCG TACACGCCTA CCGCCCATTT GCGTCAATGG GGCGGAGTTG 15050 
TTACGACATT TTGGAAAGTC CCGTTGATTT TGGTGCCAAA ACAAACTCCC 15100 
ATTGACGTCA ATGGGGTGGA GACTTGGAAA TCCCCGT6A6 TCAAACCGCT 15150 
ATCCACGCCC ATTGATGTAC TGCCAAAACC GCATCACCAT GGTAATAGCG 15200 
ATGACTAATA CGTAGATGTA CTGCCAAGTA GGAAAGTCCC ATAAGGTCAT 15250 
GTACTGGGCA TAATGCCAGG CGGGCCATTT ACCGTCATTG ACGTCAATAG 15300 
GGGGCGTACT TGGCATATGA TACACTTGAT GTACTGCCAA GTGGGCAGTT 15350 
TACCGTAAAT ACTCCACCCA TTGACGTCAA TGGAAAGTCC CTATTGG06T 15400 
TACTATGGGA ACATACGTCA TTATTGAC6T CAATGGGCGG GGGTCGTT6C 15450 
GCGGTCAGCC AGGCGGGCCA TTTACCGTAA GTTATGTAAC GACCTGCAGG 15500 
TCGACTCTAG AGGATCTCCC TAGACAAATA ITACGCGCTA T6A6TAACAC 15550 
AAAA-PTATTC AGATTTCACT TCCTCTTATT CAGTTTTCCC GCGAAAATGG 15600 
CCAAATCTTA CTCGGTTACG CCCAAATTTA CTACAACATC CGCCTAAAAC 15650 
CGCGCGAAAA TTGTCACTTC CTGTGTACAC CGGCGCACAC CAAAAACGTC 15700 
ACOTTTGCCA CATCCGTCGC TTACATGTGT TCCGCCACAC TTGCAACATC 15750 
ACACTTCCGC CACACTACTA CGTCACCCGC CCCGTTCCCA CGCCCCGCGC 15800 
CACGTCACAA ACTCCACCCC CTCATTATCA TATTGGCTrC AATCCAAAAT 15850 
AAGGTATATT ATTGATGATG CTAGCGGGGC CCTATATA1X5 GATCCAATTG 15900 
CAATGATCAT CATGACAGAT CTGCGCGCGA TCGATATCAG CGCTTTAAAT 15950 
TTCCGCATGC TAGCTATAGT TCTAGAGGTA CCGGTTGTTA ACGTTAGCCG 16000 
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GCTACGTATA 


CTCCGGAATA 


TTAATAGGCC 


TAGGATGCAT 


ATGGCGGCCG 


16050 


GCCGCCT6CA 


GCTGGC6CCA 


TCGATACGCG 


TACGTCGCGA 


CCGC6GACAT 


16100 


GTACA6A6CT 


C6A6AAGTAC 


TAGT6GCCAC 


6T666CCGTG 


CACCTTAA6C 


16150 


TTG6CACTG6 


CCGTCGTTTT 


ACAACGTCGT 


GAwTGGGAAA 


ACCCTGGC6T 


16200 


TACCCAACTT 


AATC6CCTT6 


CAGCACATCC 


CCCTTTCGCC 


A6CT66C6TA 


16250 


ATA6CGAAGA 


GGCCCGCACC 


GATCGCCCTT 


CCCAACAGTT 


GCGCAGCCTG 


16300 


AATG6CGAAT 


GGCGCCT6AT 


GCGGTATTTT 


CTCCTTACGC 


ATCTGT6C66 


16350 


TATTTCACAC 


CGCATACGTC 


AAAGCAACCA 


TAGTACGCGC 


CCT6TAGC6G 


16400 


CGCATTAAGC 


GCGGCGGGT6 


TGGTGGTTAC 


GCGCA6CGTG 


ACC6CTACAC 


16450 


TTGCCAGCGC 


CCTAGC6CCC 


GCTCCTTTCG 


CTTTCTTCCC 


TTCCTTTCTC 


16500 


GCCACGTTCG 


CCGGCTTTCC 


CCGTCAAGCT 


CTAAATCGGG 


GGCTCCCTTT 


16550 


AGGGTTCC6A 


TTTAGTGCTT 


TACGGCACCT 


CGACCCCAAA 


AAACTTGATT 


16600 


TGG6T6AT66 


TTCAC6TAGT 


GG6CCATC6C 


CCTGATAGAC 


G6TTTTTC6C 


16650 


CCTTTGA06T 


TGGAGTCCAC 


GTTCTTTAAT 


AGTGGACTCT 


T6TTCCAAAC 


16700 


TGGAACAACA 


CTCAACCCTA 


TCTCGGGCTA 


TTCTTTTGAT 


TTATAA6GGA 


16750 


TTTTGCCGAT 


TTCGGCCTAT 


TGGTTAAAAA 


AT6AGCTGAT 


TTAACAAAAA 


16800 


TTTAACGC6A 


ATTTTAACAA 


AATATTAAC6 


TTTACAATTT 


TATGGT6CAC 


16850 


TCTCAGTACA 


ATCT6CTCT6 


ATGCCGCATA 


GTTAAGCCA6 


CCCCGACACC 


16900 


CGCCAACACC 


CGCTGACGCG 


CCCT6AC6GG 


CTTGTCTGCT 


CCCGGCATCC 


16950 


GCTTACAGAC 


AAGCTGTGAC 


CGTCTCCGGG 


A6CTGCATGT 


GTCAGAGGTT 


17000 


TTCACCGTCA 


TCACCGAAAC 


GCGCGAGACG 


AAAG6GCCTC 


GTGATACGCC 


17050 


TATTTTTATA 


6GTTAAT6TC 


ATGATAATAA 


TGGTTTCTTA 


GAC6TCAGGT 


17100 


GGCACTTTTC 


GGGGAAAT6T 


GC6CGGAACC 


CCTATTT6TT 


TATTTTTCTA 


17150 


AATACATTCA 


AATATGTATC 


CGCTCAT6A6 


ACAATAACCC 


TGATAAAT6C 


17200 


TTCAATAATA 


TTGAAAAAGG 


AAGAGTATGA 


GTATTCAACA 


TTTCCGTGTC 


17250 


GCCCTTATTC 


CCTTTTTTGC 


GGCATTTTGC 


CTTCCTGTTT 


TTGCTCACCC 


17300 
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AGAAACGCTG GTGAAAGTAA AAGATGCTGA AGATCA6TT6 GGT6CACGAG 17350 
TGG6TTACAT CGAACTGGAT CTCAACAGCG GTAAGATCCT TGAGAGTTTT 17400 
CGCCCCGAAG AACGTTTTCC AATGATGAGC ACTTTTAAAG TTCTGCTATG 17450 
TGGCGCGGTA TTATCCCGTA TTGACGCCG6 GCAAGAGCAA CTCGGTCGCC 17500 
GCATACACTA TTCTCAGAAT GACTTGGTTG AGTACTCACC AGTCACAGAA 17550 
AAGCATCTTA CGGATGGCAT GACAGTAAGA GAATTATGCA GT6CTGCCAT 17600 
AACCATGA6T GATAACACTG CGGCCAACTT ACTTCTGACA ACGATCGGAG 17650 
GACCGAAGGA GCTAACCGCT TTTTTGCACA ACATGGGGGA TCATGTAACT 17700 
CGCCTTGATC GTTGGGAACC GGAGCTGAAT GAAGCCATAC CAAACGACGA 17750 
GCGTGACACC ACGATGCCTG TAGCAATGGC AACAACGTT6 CGCAAACTAT 17800 
TAACTGGCGA ACTACTTACT CTAGCTTCCC 6GCAACAATT AATAGACTGG 17850 
ATGGAGGCGG ATAAAGTTGC AGGACCACTT CTGCGCTCGG CCCTTCCGGC 17900 
TGGCTGGTTT ATTGCTGATA AATCTGGAGC CGGTGAGCGT GGGTCTCGCG 17950 
GTATCATTGC AGCACTGGGG CCAGATGGTA AGCCCTCCCG TATCGTAGTT 18000 
ATCTACACGA CGGGGAGTCA GGCAACTATG GATGAACGAA ATAGACA6AT 18050 
CGCTGAGATA GGTGCCTCAC TGATTAAGCA TTGGTAACT6 TCAGACCAAG 18100 
TTTACTCATA TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA 18150 
AGGATCTAGG TGAAGATCCT TTTTGATAAT CTCATGACCA AAATCCCTTA 18200 
ACGTGAGTTT TCGTTCCACT GAGCGTCAGA CCCCGTAGAA AAGATCAAAG 18250 
GATCTTCTTG AGATCCTTTT TTTCTGCGCG TAATCTGCTG CTTGCAAACA 18300 
AAAAAACCAC CGCTACCAGC GGTGGTTTGT TTGCCGGATC AAGAGCTACC 18350 
AACTCTTTTT CCGAAGGTAA CTGGCTTCAG CAGAGCGCAG ATACCAAATA 18400 
CTGTOCTTCT AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA 18450 
GCACCGCCTA CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC 18500 
CAGTGGCGAT AAGTCGTGTC TOACCGGGTr GGACTCAAGA CGATAGTTAC 18550 
CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG CACACAGCCC 18600 
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AGCTTGGAGC GAACGACCT7. CACCGAACT6 
ATGAGAAAGC GCCACGCTTC CCGAAGGGAG 
TAAGCGGCA6 G6TCGGAACA GGAGAGC6CA 
AAC6CCTGGT ATCTTTATA6 TCCTGTCX»3G 
GCGTCGATTT TTGTGATGCT CGTCAG6G6G 
CCAGCAACGC GGCCTTTTTA CGGTTCCTG6 
CACATGTTCT TTCCTGCGTT ATCCCCTGAT 
C6CCTTTGA6 TGAGCTGATA CCGCTC6CCG 
GCGAGTCAGT GAGC6AGGAA GCGGAA6A6C 
CTCCCCGCGC GTTGGCCGAT TCATTAATGC 
CGACTGGAAA GCGGGCAGTG AGCGCAACGC 
CTCATTAGGC ACCCCA6GCT TTACACTTTA 
TGTGGAATT6 TGAGCG6ATA ACAATTTCAC 
TGATTACGAA TTCGAATGGC CATGG6AC6T 
CCGGGCC 



AGATACCTAC 


AGCGTGA6CT 


18650 


AAAGGCGGAC 


AGGTATCCGG 


18700 


CGAGGGAGCT 


TCCAGGGGGA 


18750 


TT*'CGCCACC 


TCTGACTT6A 


18800 


GCGGAGCCTA 


TGGAAAAAC6 


18850 


CCTTTTGCTG 


GCCTTTTGCT 


18900 


TCTGTGGATA 


ACCGTATTAC 


18950 


CAGCC6AACG 


ACCGAGC6GA 


19000 


GCCCAATAC6 


CAAACCGCCT 


19050 


AGCTGGCACG 


ACAGGTTTCC 


19100 


AATTAATGTG 


AGTTAGCTCA 


19150 


TGCTTCCGGC 


TCGTATGTTG 


19200 


ACAGGAAACA 


GCTATGACCA 


19250 


CGACCTGAGG 


TAATTATAAC 


19300 
19307 
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WHAT IS CLAIMED IS: 

1. A recombinant shuttle vector comprising: 

(a) the DNA sequences of, or corresponding to, 
a portion of the genome of an auenovirus which comprises 
DNA sequences of, or corresponding to, the adenovirus 5' 
and 3* inverted terminal repeats and packaging/enhancer 
domain necessary for replication and virion encapsidation 
in the absence of sequence encoding viral genes; 

(b) a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
operatively linked to the DNA of (a) and capable of 
expression in a target cell in vivo or in vitro. 

2. The vector according to claim 1 wherein said 
DNA sequences (a) comprise the native adenovirus 5' 
inverted terminal repeats and packaging sequences. 

3. The vector according to claim 1 wherein said 
DNA sequences (a) comprise the native adenovirus 3' 
inverted terminal repeat sequences. 

4. The vector according to claim 1 wherein said 
selected gene (b) is a reporter gene. 

5. The vector according to claim 4 wherein said 
reporter gene is selected from the group consisting of 
the genes encoding B-galactosidase, alkaline phosphatase 
and green fluorescent protein. 

6. The vector according to claim 1 wherein said 
selected gene (b) is a therapeutic gene. 
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7* The vector according to claim 6 wherein said 
therapeutic gene is selected from the group consisting of 
a normal CFTR gene, a DMD Becker allele and a normal LDL 
gene. 

8. A crippled adenovirus helper virus comprising a 
modified adenovirus sequence in place of native 
adenovirus sequence map units 0-1, which modification 
reduces the packaging efficiency of said virus, said 
virus also containing selected adenovirus genes necessary 
to direct a productive viral infection. 

9. The helpier virus according to claim 8 wherein 
said modified sequence comprises: 

i. a fragment of adenovirus map units 0-1; 

11. a fragment of (i) containing a 5* inverted 
terminal repeat and between one to four selected 
packaging sequences, 

ill. a modified fragment of (i) containing at 
least one PAC consensus sequence in place of a^ least one 
native PAC sequence; and 

iv. a modified fragment of (11), wherein said 
native PAC sequences are mutated to contain modified 
sequences. 

10. The virus according to claim 8 wherein said 
modified sequence comprises Ad5 base pairs l-*2€9. 

11. The virus according to claim 8 wherein said 
sequence (11) comprises Ad5 base pairs 1-321. 

12. The virus according to claim 8 wherein said 
helper adenovirus is conjugated to a poly-cation 
secpience. 
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13. A method for producing a recombinant adenovirus 
which comprises transfecting a selected host cell with 

(a) a recombinant shuttle vector comprising 

i. the DNA sequences of, or 
corresponding to, a portion of genome of an 
adenovirus which comprises adenovirus 5* and 3' cis- 
elements necessary for replication and virion 
encapsidation in the absence of sequence encoding viral 
genes; and 

ii. a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
linked to the DNA of (a) and capable of expression in a 
target cell in vivo or in vitro; and 

(b) a helper adenovirus comprising sufficient 
adenovirus gene sequences necessary for a productive 
viral infection, wherein said transfected host cell 
permits the formation of a recombinant virus comprising 
the DNA of (i) and (ii) in an adenoviral capsid, and 

isolating and purifying the recombinant virus from 
said cell. 

14. The method according to claim 13, wherein said 
helper virus is a crippled helper virus comprising a 
modified adenovirus sequence in place of native 
adenovirus sequence map units 0-1, which modification 
reduces the packaging efficiency of said helper virus, 
said helper virus also containing selected adenovirus 
genes necessary to direct a productive viral infection. 

15. The method according to claim 13 wherein said 
helper adenovirus is associated with a poly-cation 
sequence. 
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16. The method according to claim 13 wherein said 
vector is associated with said helper adenovirus 
conjugate in a single particle. 

17. The method according co claim 13 wherein said 
helper virus is an adenovirus sequence containing 
deletions of all or portions of the Ela and Elb genes. 

18. The method according to claim 13 wherein said 
helper virus is an adenovirus sequence containing 
deletions of all or a portion of the E3 gene. 

19. A recoml)inant adenovirus comprising 

i. the DNA of, or corresponding to, a 
portion of the genome of an adenovirus which comprises 
adenovirus 5* and 3 "cis-elements necessary for 
replication and virion encapsidation in the absence of 
sequence encoding viral genes; 

ii. a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
linked to the DNA of (a) and capable of expression in a 
target cell in vivo or in vitro; 

said DNA and gene encapsidated in an adenoviral 
caps id. 

20. The virus according to claim 19 wherein said 
viral capsid is a capsid of an adenovirus serotype 
selected from the group consisting of types 2, 4, 5, 7, 
12 and 40. 

21. The virus according to claim 19 wherein said 
selected gene is a CFTR gene, a DMD gene and an LDL gene. 
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22. The use Of a recombinant adenovirus according 
to claim 19 for the manufacture of a pharmaceutical 
composition suitable for delivering and integrating a 
selected gene into the chromosome of a target cell. 
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FI61JRE 3A 



GAACTCGAGC 


AGCTGAAGCT 


TGAATTCCAT CATCAATAAT 


ATACCTTATT 


50 


TTGGATTGAA 


GCCAATATGA 


TAATGAG6G6 


GTGGAGTTTG 


TGACGTGG06 


100 


CGGGGCGTG6 


GAACGGG6CG 


GGTGACGTA6 


GTTTTAGGGC 


GGAGTAACTT 


150 


GTATGTGTTG 


GGAATTGTAG 


TTTTCTTAAA 


ATGGGAAGTT 


ACGTAACGTG 


200 


GGAAAACGGA 


AGTGACGATT 


TGAGGAAGTT 


GTGGGTTTTT 


TGGCTTTCGT 


250 


TTCTGGGCGT 


AGGTTCGCGT 


GCGGTTTTCT 


GGGTGTTTTT 


TGTGGACTTT 


300 


AACCGTTAC6 


TCATTTTTTA 


GTCCTATATA 


TACTCGCTCT 


GCACTTGGCC 


350 


CTTTrn'ACA 


CTGTGACTGA 


TTGA6CTGGT 


GCCGTGTCGA 


GTGGTGTTTT 


400 


TTTAATAGGT TTTCTTTTTT ACTGGTAAGG 


CTGACTGTTA 


GCCTGCCGCT 


450 


GTGAAGCGCT 


GTATGTTGTT 


CTGGAGCGGG 


AGGGTGCTAT 




500 


GAGGAG6GTT 


TTTCAGGTGT 


TTATGTGTTT 


TTCTCTCCTA 




550 


TATACCTCCT 


ATGG6GGCT6 


TAATGTTGTC 


TCTACGCCTG 


CGGGTATGTA 


600 


TTCCCCCCyiA 


GCTTGCATGC 


CTGCAGGTCG 


ACTCTAGAGG 


ATCCGAAAAA 


650 


ACCTCCCACA 


CCTCCCCCTG 


AACCTGAAAC 


ATAAAATGAA 


TGCAATTGTT 


700 


GTTGTTAACT 


TGTTTATTGC 


AGCTTATAAT 


GGTTACAAAT 


AAAGCAATA6 


750 


CATCACAAAT 


TTCACAAATA AAGCATTTTT 


TTCACTGCAT 


TCTAGTTGTG 


800 


GTTTGTCCAA 


ACTCATCAAT 


GTATCTTATC 


ATGTCTGGAT 


CCCCGCGGCC 


850 


GCCTAGAGTC 


GAGGCCGAGT 


TTGTCAGAAA 


GCAGACCAAA 


CAGCGGTTGG 


900 


AATAATAGCG 


AGAACAGAGA 


AATAGCGGCA 


AAAATAATAC 


CCGTATCACT 


950 


TTTGCTGATA 


TGGTTGATGT 


CATGTAGCCA 


AATCGGGAAA 


AACGGGAAGT 


1000 


A6GCTCCCAT GATAAAAAAG TAAAAGAAAA AGAATAAACC GAACATCCAA 


1050 


AAGTTTGTGT 


TTTTTAAATA 


GTACATAATG 


GATTTCCTTA 


CGCGAAATAC 


lioo 


GGGCAGACAT 


GGCCTGCCCG 


GTTATTATTA 


TTTTTGACAC 


CAGACCAACT 


1150 


GGTAATGGTA 


GCGACCGGCG 


CTCAGCTGTA 


ATTCCGCCGA 


TACTGACGGG 


1200 


CTCCAGGAGT 


CGTCGCCACC 


AATCCCCATA 


TGGAAACCGT 


CGATATTCAG 


1250 


CCATGTGCCT 


TCTTCCGCGT 


GCAGCAGATG 


GCGATGGCTG 


CTTTCCATCA 


1300 


GTTGCTGTTG 


ACTGTAGCGG 


CTGATGTTGA 


ACTGGAAGTC 


GCCGCGCCAC 


1350 
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FIGURE 3B 

TGGTGTGGGC CATAATTCAA TTCGCGCGTC CCGCAGCGCA GACCGTTTTC 1400 

GCTCGGGAAG ACGTAC6GG6 TATACATGTC TGACAATGCC AGATCCCAGC 1450 

GGTCAAAACA GGCGGCAGTA AGGCGGTCGG GATAGTTTTC TTGC66CCCT 1500 

AATCCGAGCC AGTTTACCCG CTCTGCTACC TGCGCCAGCT GGCAGTTCAG 1550 

GCCAATCCGC GCCGGATGCG GTGTATCGCT CGCCACTTCA ACATCAACGG 1600 

TAATC6CCAT TTGACCACTA CCATCAATCC GGTAGGTTTT CCGGCTGATA 1650 

AATAAGGTTT TCCCCTGATG CTGCCACGCG TGAGCGGTCG TAATCaiGCAC 1700 

CGCATCAGCA AGTGTATCTG CCGTGCACTG CAACAACGCT GCTTCGGCCT 1750 

GGTAATGGCC CGCCGCCTTC CAGCGTTCGA CCCAGGCGTT AGGGTCAATG 1800 

CGGCTCGCTT CACTTACCCC AATGTCGTTA TCCAGCGGTG CACGGGTGAA 1850 

CTGATCGC6C AGCGGCGTCA 6CAGTT6TTT TTTATCGCCA ATCCACATCT 1900 

GTGAAAGAAA GCCTGACTGG CGGTTAAATT GCCAAC6CTT ATTACCCAGC 1950 

TCGATGCAAA AATCCATTTC GCTGGTGGTC AGATGCGGGA T6GCGTGGGA 2000 

CGCGGCGGGG AGCGTCACAC TGAGGTTTTC CGCCAGACGC CACTGCTGCC 2050 

AGGCGCTGAT GTGCCCGGCT TCTGACCATG CGGTCGCGTT CGGTTGCACT 2100 

ACGCGTACTG TGAGCCAGAG TTGCCCGGCG CTCTCCGGCT GCGGTAGTTC 2150 

AGGCAGTTCA ATCAACTGTT TACCTTGTGG AGCGACATCC AGAGGCACTT 2200 

CACCGCTTGC CAGCGGCTTA CCATCCAGCX5 CCACCATCCA GTGCAGGAGC 2250 

TCGTTATCGC TATGACGGAA CAGGTATTCG CTGGTCACTT CGATGGTTT6 2300 

CCCGGATAAA CGGAACTGGA AAAACTGCTG CTGGTGTTTT GCTTCCGTCA 2350 

GCGCTGGATG CGGCGTGCGG TCGGCAAAGA CCAGACCGTT CATACAGAAC 2400 

TGGCGATCGT TCGGCGTATC GCCAAAATCA CCGCCGTAAG CCGACCACGG 2450 

GTTGCCGTTT TCATCATATT TAATCAGCGA CTGATCCACC CAGTCCCA6A 2500 

CGAAGCCGCC CTGTAAACGG GGATACTGAC GAAACGCCTG CCAGTATTTA 2550 

GCGAAACCGC CAAGACTGTT ACCCATCGCG TGGGCGTATT CGCAAAGGAT 2600 

CAGCGGGCGC GTCTCTCCA6 GTAGCGAAAG CCATTTTTTG ATGGACCATT 2650 
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TCGGCACAGC 


CGGGAAGGGC 


TGGTCTTCAT 


CCACGCGCGC 


GTACATCGG6 


2700 


CAAATAATAT 


CGGT6GCCGT 


GGTGTCGGCT 


CCGCC6CCTT 


CATACTGCAC 


2750 


CGGGCGGGAA 


GGATCGACAG 


ATTTGATCCA 


GCGATACAGC 


GCGTCGT6AT 


2800 


TAGCGCCGTG 


GCCTGATTCA 


TTCCCCAGCG 


ACCAGATGAT 


CACACTCGG6 


2850 


TGATTACGAT 


CGCGCTGCAC 


CATTCGCGTT 


ACGCGTTCGC 


TCATCGCCG6 


2900 


TAGCCAGCGC 


GGATCATCGG 


TCAGACGATT 


CATTGGCACC 


ATGCCGT66G 


2950 


TTTCAATATT GGCTTCATCC ACCACATACA G6CCGTAGCG 


GTCGCACA6C 


3000 


GTGTACCACA 


GCGGATGGTT 


CGGATAATGC 


GAACAGCGCA 


CGGCGTTAAA 


3050 


GTTGTTCTGC 


TTCATCAGCA 


GGATATCCTG 


CACCATCGTC 


TGCTCATCCA 


3100 


TGACCTGACC ATGCAGAGGA T6ATGCTCGT 


GACGGTTAAC 


GCCTCGAATC 


3150 


A6CAACGGCT 


TGCCGTTCAG 


CAGCAGCA6A 


CCATTTTCAA 


TCCGCACCTC 


3200 


GCGGAAACC6 


ACATCGCA66 


CTTCTGCTTC 


AATCAGCGTG 


CCGTCGGCGG 


3250 


TGTGCAGTTC 


AACCACCGCA 


CGATAGAGAT 


TCGGGATTTC 


GGCGCTCCAC 


3300 


AGTTTCGGGT 


TTTCGACGTT 


CAGACGTAGT 


GTGACGCGAT 


CGGCATAACC 


3350 


ACCACGCTCA 


TCGATAATTT 


CACCGCCGAA AGGCGCGGTG 


CCGCTGGCGA 


3400 


CCTGCGTTTC 


ACCCTGCCAT AAAGAAACTG 


TTACCCGTA6 


GTAGTCAC6C 


3450 


AACTCGCCGC ACATCTGAAC TTCAGCCTCC AGTACAGCGC GGCTGAAATC 


3500 


ATCATTAAAG 


CGAGTGGCAA 


CATGGAAATC 


GCTGATTTGT 


GTAGTCGGTT 


3550 


TATGCAGCAA 


CGAGACGTCA 


CGGAAAATGC 


CGCTCATCCG 


CCACATATCC 




TGATCTTCCA 


GATAACTGCC 


GTCACTCCAA 


CGCAGCACCA 


TCACCGCGA6 


3650 


GCGGTTTTCT 


CCGGCGCGTA 


AAAATGCGCT 


CAGGTCAAAT 


TCAGACGGCA 


3700 


AACGACTGTC 


CTGGCCGTAA 


CCGACCCAGC 


GCCCGTTGCA 


CCACAGATGA 


3750 


AACGCCGAGT 


TAACGCCATC 


AAAAATAATT 


CGCGTCTGGC 


CTTCCTGTAG 


3800 


CCAGCTTTCA 


TCAACATTAA 


ATGTGAGCGA 


GTAACAACCC 


GTCGGATTCT 


3850 


CCGTGGGAAC AAACGGCGGA TTGACCGTAA TGGGATAGGT TACGTTGGTG 


3900 


TAGATGGGCG 


CATCGTAACC 


GTGCATCTGC 


CAGTTTGAGG 


GGACGACGAC 


3950 
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FIGURE 3D 

AGTATCGGCC TCAGGAAGAT CGCACTCCAG CCAGCTTTCC GGCACCGCTT 4000 

CTGGTGCCGG AAACCAGGCA AAGCGCCATT CGCCATTCAG GCTGCGCAAC 4050 

TGTTGGGAAG GGCGATCGGT GCGGGCCTCT TC-.CTATTAC GCCAGCTG6C 4100 

CAAAGGGGGA TGTGCTGCAA GGCGATTAA6 TTGGGTAACG CCAGGGTTTT 4150 

CCCAGTCACG ACGTTGTAAA ACGACGGGAT CGCGCTTGAG CAGCTCCTTG 4200 

CTGGTGTCCA GACCAATGCC TCCCAGACCG GCAACGAAAA TCACGTTCTT 4250 

GTTGGTCAAA GTAAACGACA TGGTGACTTC TTTTTTGCTT TAGCAGGCTC 4300 

TTTCGATCCC CGGGAATTGC GGCCGC6GGT ACAATTCCGC AGCTTTTAGA 4350 

GCAGAAGTAA CACTTCCGTA CAGGCCTAGA AGTAAAGGCA ACATCCACTG 4400 

AGGAGCAGTT CTTTGATTTG CACCACCACC GGATCCGGGA CCTGAAATAA 4450 

AAGACAAAAA GACTAAACTT ACCAGTTAAC TTTCTGGTTT TTCAGTTCCT 4500 

C6AGTACCG6 ATCCTCTAGA GTCCGGAGGC TGGATCGGTC CCGGTCTCTT 4550 

CTATGGAGGT CAAAACAGCX5 TGGATGGCGT CTCCAGGOGA TCTGACGGTT 4600 

CACTAAACGA GCTCTGCTTA TATAGACCTC CCACCGTACA CGCCTACC6C 4650 

CCATTTGCGT CAATGGGGC6 GAGTTGTTAC GACATTTTGG AAAGTCCCGT 4700 

TGATTTTGGT GCCAAAACAA ACTCCCATT6 ACGTCAATGG GGTGGAGACT 4750 

T6GAAATCCC CGTGAGTCAA ACCGCTATCC ACGCCCATTG ATGTACTGCC 4800 

AAAACCGCAT CACCAT66TA ATAGC6ATGA CTAATACGTA GATGTACTGC 4850 

CAAGTAGGAA AGTCCCATAA GGTCATGTAC TGGGCATAAT 6CCAGGCG66 4900 

CCATTTACCG TCATTGACGT CAATAGGGGG CGTACTTGGC ATATGATACA 4950 

CTTGATGTAC TGCCAAGTGG GCAGTTTACC GTAAATACTC CACCCATT6A 5000 

CGTCAATGGA AAGTCCCTAT TGGCGTTACT ATGGGAACAT ACGTCATTAT 5050 

TGACGTCAAT GGGCGGGGGT CGTTGGGCGG TCAGCCAGGC GGGCCATTTA 5100 

CCGTAAGTTA TGTAACGACC TGCAGGTCGA CTCTA6A6GA TCTCCCTAGA 5150 

CAAATATTAC GCGCTATGA6 TAACACAAAA TTATTCAGAT TTCACTTCCSr 5200 

CTTATTCAGT TTTCCCGCGA AAATGGCCAA ATCTTACTCG GTTACGCCCA 5250 
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FIGURE 3E 



AATTTACTAC 


AACATCCGCC 


TAAAACCGCG 


CGAAAATTGT 


CACTTCCTGT 


5300 


GTACACCGGC 


GCACACCAAA 


AACGTCACTT 


TTGCCACATC 


CGTCGCTTAC 


5350 


ATGTGTTCCG 


CCACACTTGC 


AACATCACAC 


TTCCGCCACA 


CTACTAC6TC 


5400 


ACCCGCCCCG 


TTCCCACGCC 


CCGCGCCACG 


TCACAAACTC 


CACCCCCTCA 


5450 


TTATCATATT 


GGCTTCAATC 


CAAAATAAGG 


TATATTATTG 


ATGATGCTAG 


5500 


CGAATTCATC 


GATGATATCA 


GATCTGCCGG 


TCTCCCTATA 


GTGAGTCGTA 


5550 


TTAATTTCGA 


TAAGCCAGGT 


TAACCTGCAT 


TAATGAATCG 


GCCAACGC6C 


5600 


GGGGAGAGGC 


GGTTTGCGTA 


TTGGGCGCTC 


TTCCGCTTCC 


TCGCTCACT6 


5650 


ACTCGCTGCG 


CTCGGTCGTT 


CGGCTGCGGC 


GAGCGGTATC 


AGCTCACTCA 


5700 


AAGGCGGTAA 


TACGGTTATC 


CACAGAATCA 


GGGGATAACG 


CAGGAAAGAA 


5750 


CATGTGAGCA 


AAAGGCCAGC 


AAAAGGCCAG 


GAACCGTAAA 


AAGGCCGCGT 


5800 


TGCTGGCGTT 


TTTCCATAGG 


CTCCGCCCCC 


CTGACGAGCA 


TCACAAAAAT 


5850 


CGACGCTCAA 


GTCAGAGGTG 


GCGAAACCCG 


ACAGGACTAT 


AAAGATACCA 


5900 


GGCGTTTCCC 


CCTGGAAGCT 


CCCTCGTGCG 


CTCTCCTGTT 


CCGACCCT6C 


5950 


CGCTTACCGG 


ATACCTGTCC 


GCCTTTCTCC 


CTTCGGGAAG 


CGTGGCGCTT 


6000 


TCTCAATGCT 


CACGCTGTAG 


GTATCTCAGT 


TCGGTGTAGG 


TCGTTCGCTC 


6050 


CAAGCTGGGC 


TGTGTGCACG 


AACCCCCCGT 


TCAGCCCGAC 


CGCT6CGCCT 


6100 


TATCCGGTAA 


CTATCGTCTT 


GAGTCCAACC 


CGGTAAGACA 


CGACTTATC6 


6150 






TAACAGGATT 

A«W»w«»WW#%A A 


AGCAGAGCGA 


GGTAT6TAGG 


6200 


CGGTGCTACA 


GAGTTCTTGA 


AGTGGTGGCC 


TAACTACGGC 


TACACTAGAA 


6250 


GGACAGTATT 


TGGTATCTGC 


GCTCTGCTGA 


AiSCCAGTTAC 


CTTCGGAAAA 


6300 


AGAGTTGGTA 


GCTCTTGATC 


CGGCAAACAA 


ACCACC6CTG 


CTAGCGGTGG 


6350 


TTTTTTTGTT 


TGCAAGCAGC 


AGATTACGCG 


CAGAAAAAAA 


GGATCTCAAG 


6400 


AAGATCCTTT 


GATCTTTTCT 


ACGGGGTCTG 


ACGCTCAGTG 


GAACGAAAAC 


6450 


TCACGTTAAG 


GGATTTTGGT 


CATGAGATTA 


TCAAAAA6GA 


TCTTCACCTA 


6500 


GATCCTTTTA 


AATTAAAAAT 


GAAGTTTTAA 


ATCAATCTAA 


A6TATATAT6 


6550 
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FIGURE 3F 

AGTAAACTTG GTCTGACAGT TACCAXT6CT TAATCAGTGA GGCACCTATC 6600 
TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT 6650 
GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 6700 
TGATACCGCG AGACCCACXSC TCACCGGCTC O'SATTTATC AGCAATAAAC 6750 
CAGCCAGCCG GAAGGGCCGA GCGCA6AAGT GGTCCTGCAA CTTTATCCOC 6800 
CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 6850 
CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 6900 
TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC 6950 
AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 7000 
TCGGTCCTCC 6ATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC 7050 
ATGGTTATGG CAGCACTGCA TAATTCTCTT ACT6TCATGC CATCCGTAAC 7100 
ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 7150 
GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATAC6 GGATAATACC 7200 
GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC 7250 
GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGAT6T 7300 
AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCA6C 7350 
GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT 7400 
AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT TTTCAATATT 
ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 
TGTATTTA6A AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA 
AGTGCCACCT GACGTCTAAG AAACCATTAT TATCAT6ACA TTAACCTATA 7600 
AAAATAGGCG TATCACGAGG CCCTTTCGTC TCGCGCGTTT CGGTGATGAC 
GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA CAGCTTGTCT 
GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 
TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA 
CTGAGAGTGC ACCATATGGA CATATTGTCG TTAGAACGCG GCTACAATTA 
ATACATAACC TTATGTATCA TACACATACG ATTTAGGTGA CACTATA 
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FIGURE 5A 

6AATTCCCTA GCTAGCGGGG GAATACATAC CC6CA6GCGT A6A6ACAACA 50 

TTACAGCCCC CATAGGAGGT ATAACAAAAT TAATAGGAGA GAAAAACACA 100 

TAAACACCTG AAAAACCCTC CTGCCTAGGC AAAATAGCAC CCTCCCGCTC 150 

CAGAACAACA TACAGCGCTT CACAGCGGCA GCCTAACAGT CAGCCTTACC 200 

AGTAAAAAA6 AAAACCTATT AAAAAAACAC CACTCGACAC GGCACCAGCT 250 

CAATCAGTCA CAGTGTAAAA AAGG6CCAA6 T6CAGAGCGA GTATATATAG 300 
GACTAAAAAA TGACGTAACG GTTAAAGTCC ACAAAAAACA CCCAGAAAAC 350 
CGCACGCGAA CCTACGCCCA GAAACGAAAG CCAAAAAACC CACAACTTCC 400 
TCAAATCGTC ACTTCCGTTT TCCCACGTTA CGTAACTTCC CATTTTAAGA 450 
AAACTACAAT TCCCAACACA TACAAGTTAC TCCGCCCTAA AACCTACGTC 500 
ACCC6CCCC6 TTCCCACGCC CCGC6CCAC6 TCACAAACTC CACCCCCTCA 550 
TTATCATATT GGCTTCAATC CAAAATAAG6 TATATTATTG ATGATGCTAG 600 
CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 650 
GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 700 
TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 750 
GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 800 
GAAGTGACAA TTTTCGCGCG GTTTTAGGC6 GATGTTGTA6 TAAATTTGG6 850 
CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAA6AGGA 900 
AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 950 

GGGAGATCAG CCTGCAGGTC GTTACATAAC TTACGGTAAA TGGCCCGCCT 1000 

GGCTGACCGC CCAACGACCC CCGCCCATT6 ACGTCAATAA TGACGTATGT 1050 

TCCCATAGTA ACGCCAATAO G6ACTTTCCA TTGACGTCAA TGG6TGGAGT 1100 

ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 1150 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA 1200 

TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG 1250 

TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA GTACATCAAT 1300 
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FIGURE SB 



GGGCGTGGAT AGCGGTTTGA 


CTCACGGGGA 


TTTCCAAGTC 


TCCACCCCAT 


1350 


TGACGTCAAT 


6GGAGTTTGT 


TTTGGCACCA 


AAATCAACG6 


GACTTTCCAA 


1400 


AATGTCGTAA 


CAACTCCGCC 


CCATTGACGC 


AAATGGGCGG 


TAGGCGTGTA 


1450 


CGGTGGGAGG 


TCTATATAAG 


CAGAGCTCGT 


TTAGTGAACC 


GTCAGATCGC 


1500 


CTGGAGACGC 


CATCCACGCT 


GTTTTGACCT 


CCATAGAAGA 


CACCGGGACC 


1550 


GATCCAGCCT 


CCGGACTCTA 


GAGGATCCGG 


TACTCGAGGA 


ACTGAAAAAC 


1600 


CAGAAAGTTA 


ACTGGTAAGT 


TTAGTCTTTT 


TGTCTTTTAT 


TTCAGGTCCC 


1650 


GGATCCGGTG 


GTGGTGCAAA 


TCAAAGAACT 


GCTCCTCAGT 


6GATGTTGCC 


1700 


TTTACTTCTA 


GGCCTGTAC6 


GAAGTGTTAC 


TTCTGCTCTA 


AAA6CT6CG6 


1750 


AATTGTACCC 


GCGGCCGCAA 


TTCCCGGGGA 


TCGAAAGAGC 


CTGCTAAAGC 


1800 


AAAAAAGAAG 


TCACCATGTC 


GTTTACTTTC 


ACCAACAAGA 


ACGTGATTTT 


1850 


CGTTGCCGGT 


CTGGGAGGCA 


TTGGTCTGGA 


CACCAGCAAG 


GAGCT6CTGA 


1900 


AGCGCGATCC 


CGTCGTTTTA 


CAACGTCGTG 


ACTGGGAAAA 


CCCTGGCGTT 


1950 


ACCCAACTTA 


ATCGCCTTGC 


AGCACATCCC 


CCTTTCGCCA 


GCTGGCGTAA 


2000 


TAGCGAAGAG 


GCCCGCACCG 


ATCGCCCTTC 


CCAACAGTTG 


CGCAGCCTGA 


2050 


ATGGCGAAT6 


GCGCTTTGCC 


T6GTTTCCGG 


CACCAGAAGC 


GGTGCCGGAA 


2100 


AGCTGGCTGG 


AGTGCGATCT 


TCCTGAGGCC 


GATACTGTCG 


TCGTCCCCTC 


2150 


AAACTGGCA6 


ATGCACG6TT 


ACGATGCGCC 


CATCTACACC 


AACGTAACCT 


2200 


ATCCCATTAC 


GGTCAATCCG 




CCACGGAGAA 


TCCGACGG6T 


2250 


TGTTACTCGC 


TCACATTTAA 


TGTTGATGAA 


AGCTGGCTAC 


AGGAAGGCCA 


2300 


GACGCGAATT 


ATTTTTGATG 


GCGTTAACTC 


GGCGTTTCAT 


CTCTGGTGCA 


2350 


ACGGGCGCTG 


GGtCGGTTAC 


GGCCAGGACA 


GTCGTTT6CC 


GTCTGAATTT 


2400 


GACCTGAGCG 


CATTTTTACG 


CGCCGGAGAA 


AACCGCCTCG 


C6GTGAT6GT 


2450 


GCTGCGTTGG 


AGTGACGGCA 


GTTATCTGGA 


AGATCAGGAT 


ATGTGGCGGA 


2500 


TGAGCGGCAT 


TTTCCGTGAC 


GTCTCGTTGC 


TGCATAAACC 


GACTACACAA 


2550 


ATCAGCGATT TCCATGTTGC 


CACTCGCTTT 


AATGATGATT 


TCAGCCGCGC 


2600 



sttsnmiESiiEEroniEit) 



wo 96/13597 



PCTAJS95/14017 



12/45 
FIGURE 5C 

TGTACTGGAG GCTGAAGTTC AGATGTGCGG C6AGTTGCGT GACTACCTAC 
GGGTAACAGT TTCTTTATGG CAG6GT6AAA CGCAGGTCGC CAGCGGCACC 
GCGCCTTTCG GCGGTGAAAT TATCGATCAG CGTGGTGGTT ATGCCGATC6 
CGTCACACTA CGTCTGAACG TCGAAAACCC 6AAACT6T6G AGCGCC6AAA 
TCCCGAATCT CTATCGTGCG GTGGTTGAAC TGCACACCGC CGACGGCAC6 
CT6ATTGAAG CAGAAGCCTG CGATGTCGGT TTCCGCGAGG TGCGGATTGA 
AAATGGTCTG CTGCTGCTGA ACGGCAAGCC GTTGCTGATT CGAGGCGTTA 
ACCGTCACGA GCATCATCCT CTGCATGGTC AGGTCATGGA TGAGCAGACC 
ATGGTGCAGG ATATCCTGCT GATGAAGCAG AACAACTTTA ACGCCGTGC6 
CTGTTCGCAT TATCCGAACC ATCCGCTGTG GTACACGCTG TGCGACCGCT 
ACGGCCTGTA TGTGGTGGAT GAAGCCAATA TTGAAACCCA CGGCATGGTG 
CCAATGAATC GTCTGACC6A TGATCCGCGC TGGCTACCGG CGATGAGCGA 
ACGCGTAACG CGAATGGTGC AGCGCGATCG TAATCACCC6 AGT6TGATCA 
TCTGCTCGCT G6GGAATGAA TCAGGCCAC6 6C6CTAATCA CGAC6CWCTC 
TATCGCTGGA TCAAATCTGT CGATCCTTCC CGCCCGGTGC AGTATGAA6G 
CGGCGGAGCC GACACCACGG CCACCCATAT TATTTGCCCG ATGTACGCGC 
GCGTGGATGA AGACCAGCCC TTCCCGGCTG TGCCGAAATG GTCCATCAAA 
AAATGGCTTT C6CTACCTGG AGAGACGCGC CCGCTGATCC TTTGCGAATA 
CGCCCACGCG ATGGGTAACA GTCTTGGCGG TTTCGCTAAA TACTGGCA6G 
CGTTTCGTCA GTATCCCCGT TTACAGGGCG GCTTCGTCTG GGACTGGGTG 
GATCAGTCGC TGATTAAATAnTGATGAAAAC GGCAACCCGT GGTCGGCTTA 
CGGCGGTGAT TTTGGCGATA CGCCGAACGA TCGCCAGTTC TGTATGAACG 
GTCTGGTCTT TGCCGACCGC ACGCCGCATC CAGCGCTGAC GGAAGCAAAA 
CACCAGCAGC AGTTTTTCCA GTTCCGTTTA TCCGGGCAAA CCATCGAAGT 
GACCAGCGAA TACCTGTTCC GTCATAGCGA TAACGAGCTC CTGCACTGGA 
TGGTG6CGCT 6GATGGTAAG CCGCTGGCAA GCGGTGAAGT GCCTCTGGAT 
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FIGURE 5D 



GTC6CTCCAC 


AAG6TAAACA 


GTTGATTGAA 


CTGCCTGAAC 


TACCGCAGCC 


3950 


GGAdAGCGCC 


GGGCAACTCT 


GGCTCACAGT 


ACGCGTA6TG 


CAACCGAACG 


4000 


CGACCGCATG 


GTCAGAAGCC 


GGGCACATCA 


GCGCCTGGCA 


GCAGTGGCGT 


4050 


CTGGCGGAAA 


ACCTCAGTGT 


GACGCTCCCC 


GCPGCGTCCC 


ACGCCATCCC 


4100 


GCATCT6ACC 


ACCAGCGAAA 


TG6ATTTTTG 


CATCGAGCT6 


GGTAATAAGC 


4150 


GTTGGCAATT TAXCCGCCA6 TCA6GCTTTC TTTCACA6AT GTGGATTGGC 


4200 


GATAAAAAAC AACTGCTGAC GCCGCTGCGC GATCAGTTCA CCCGTGCACC 


4250 


GCTGGATAAC 


GACATTGGCC 


X A/IW X w AAW W 


wAwwwwWaX X 


wAwWWXAAWW 


4300 


CCTGGGTCGA 


ACG CTGG AAC 

A www A WWAAW 


WWWWWwwwWw 


AX X AwwAwwW 


wwAAwWAwww 


4350 


fPPGTTGCA.GT 


GCACGGCACA 

w w WW w^n 


X A WAW X X WW X 


WAXA9WWW XAaw 


KTT A PC Af? 

X WAX XAWwAW 


4400 


www A wAWw WW 


A WO WAW WA X W 


AwwwwaAAaw 




aw^wwwAAaa 


4450 


VW A AW W X 


1 wAl. WW X AW X 


GGTCAAATGG 


CGATTACCGT 


TwATGTTGAA 


4500 




AX AW/%WWwWA 


TCCGGCGCGG 


ATTGGCCTGA 


AC X w U wib w X 


4550 


GGCGCAGGTA 


GCAGAGCGGG 


TAAACTGGCT 


CGGATTAGGG 


CCGCAAGAAA 


4600 


ACTATCCCGA 


CCGCCTTACT 


GCCGCCTGTT 


TTGACCGCT6 


GGATCTGCCA 


4650 


TTGTCAGACA 


TGTATACCCC 


GTACGTCTTC 


CCGAGCGAAA 


ACGGTCTG06 


4700 


CTGCGGGACG 


CGCGAATTGA 


ATTATGGCCC 


ACACCAGTGG 


CGCGGCGACT 


4750 


TCCAGTTCAA 


CATCAGCCGC 


TACAGTCAAC 


AGCAACTGAT 


GGAAACCAGC 


4800 


CATCGCCATC 


TGCTGCACGC 


GGAAGAAGGC 


ACATGGCTGA 


ATATCGACGG 


4850 


TTTCCATATG 


6GGATT6GTG 


6CGACGACTC 


CTGGAGCCCG 


TCAGTATCGG 


4900 


CGGAATTACA 


GCTGAGCGCC 


6GTCGCTACC 


ATTACCA6TT 


G6TCTG6TGT 


4950 


CAAAAATAAT 


AATAACCGGG 


CAGGCCATGT 


CTGCCCGTAT 


TTCGCGTAA6 


5000 


GAAATCCATT 


ATGTACTATT 


TAAAAAACAC 


AAACTTTTGG 


ATGTTCGGTT 


5050 


TATTCTTTTT 


CTTTTACTTT 


TTTATCATGG 


GAGCCTACTT 


CCCGTTTTTC 


5100 


CCGATTTGGC 


TACATGACAT 


CAACCATATC AGCAAAAGTG 


ATACGGGTAT 


5150 


TATTTTTGCC 


GCTATTTCTC 


TGTTCTCGCT 


ATTATTCCAA 


CCGCTGTTT6 


5200 


GTCTGCTTTC 


TGACAAACTC 


GGCCTCGACT 


CTAGGCGGCC 


GCGGGGATCC 


5250 
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FIGURE 5E 

AGXCATGATA AGATACATTG AT6AGTTTGG ACAAACCACA ACTAGAATGC 5300 

A6TGAAAAAA ATGCTTTATT TGTGAAATTT GTGATGCTAT TGCTTTATTT 5350 

GTAACCATTA TAAGCTGCAA TAAACAAGTT AACAACAACA ATTGCATTCA 5400 

TTTTATGTTT CAGGTTCAGG GGGAGGTGTG GGAGGTTTTT TCGGATCCTC 5450 

TAGAGTCGAC GACGCGAGGC TGGATGGCCT TCCCCATTAT GATTCTTCTC 5500 

GCTTCCG6CG GCATCGGGAT 6CCCGC6TTG CAGGCCATGC TGTCCAGGCA 5550 

6GTAGATGAC GACCATCAGG GACAGCTTCA AGGATCGCTC GCGGCTCTTA 5600 

CCAGCCTAAC TTCGATCACT CGACCGCT6A TCGTCACGGC GATTTATGCC 5650 

GCCTCGGCGA GCACATGGAA CGGGTTGGCA TGGATTGTAG GCGCCGCCCT 5700 

ATACCTTGTC TGCCTCCCCG CGTTGCGTCG CGGTGCATGG AGCCGGGCCA 5750 

CCTCGACCT6 AATGGAAGCC GGCGCCACCT CGCTAACGGA TTCACCACTC 5800 

CAA6AATTG6 A6CCAATCAA TTCTTGCGGA GAACTGTGAA TGCGCAAACC 5850 

AACCCTTGGC AGAACATATC CATCGCGTCC GCCATCTCCA GCAGCCGCAC 5900 

GCGGCGCATC TCGGGCAGCG TTGGGTCCTG GCCACGGGTG CGCATGATC6 5950 

TCCTCCTGTC GTTGAGGACC CGGCTAGGCT GGCGGGGTTG CCTTACTGGT 6000 

TAGCAGAAT6 AATCACCGAT ACGCGA60GA ACGTGAAGCG ACTGCTGCTG 6050 

CAAAACGTCT 6CGACCTGAG CAACAACATG AATGGTCTTC GGTTTCCGT6 6100 

TTTCGTAAAG TCTGGAAACG CGGAAGTCAG CGCCCTGCAC CATTATGTTC 6150 

CGGATCTGCA TCGCAGGATG CTGCTGGCTA CCCTGTGGAA CACCTACATC 6200 

TGTATTAACG AAGCCTTTCT CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 6250 

GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 6300 

GCCCGACC6C TGCGCCTTAT CCGGTAACTA TCGTCTTGA6 TCCAACCCGG 6350 

TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA CAGGATTAGC 6400 

AGAGCGAGGT ATGTAGGCGG TGCTACAGA6 TTCTTGAAGT GGTGGCCTAA 6450 

CTACGGCTAC ACTAGAAGGA CAGTATTTGG TATCTGCGCT CTGCTGAAGC 6500 

CAGTTACCTT CGGAAAAAGA GTTGGTAGCT CTTGATCCGG CAAACAAACC 6550 
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FIGURE 5F 



ACCGCTGGTA 


GCGGTGGTTT 


TTTTGTTTGC 


AA6CAGCAGA 


TTACGCGCA6 


6600 


AAAAAAAGGA 


TCTCAAGAAG 


ATCCTTTGAT 


CTTTTCTAC6 


GGGTCTGAC6 


6650 


CTCAGTGGAA 


CGAAAACTCA 


CGTTAAGGGA 


TTTTGGTCAT 


GAGATTATCA 


6700 


AAAAGGATCT 


TCACCTAGAT 


CCTTTTAAAT 


TA\AAATGAA 


GTTTTAAATC 


6750 


AATCTAAAGT ATATATGAGT AAACTTGGTC TGACAGTTAC 


CAATGCTTAA 


6800 


TCAGTGAGGC 


ACCTATCTCA 


GCGATCTGTC 


TATTTCGTTC 


ATCCATAGTT 


6850 


GCCTGACTCC 


CCGTCGTGTA 


GATAACTAC6 


ATACGG6AGG 


GCTTACCATC 


6900 


TGGCCCCAGT 


GCTGCAATGA 


TACCGCGAGA 


CCCACGCTCA 


CCGGCTCCAG 


6950 


ATTTATCAGC 


AATAAACCAG 


CCAGCCGGAA 


GGGCCGAGC6 


CAGAAGTG6T 


7000 


CCTGCAACTT 


TATCCGCCTC 


CATCCAGTCT 


ATTAATTGTT 


GCCGGGAAGC 


7050 


TAGAGTAAGT 


AGTTCGCCA6 


TTAATAGTTT 


GCGCAACGTT 


GTTGCCATT6 


7100 


CTGCAG6CAT 


CGTGGT6TCA 


CGCTCGTCGT 


TTGGTATGGC 


TTCATTCAGC 


7150 


TCCGGTTCCC 


AACGATCAAG 


GCGAGTTACA 


TCATCCCCCA 


TGTTGTGCAA 


7200 


AAAAGCGGTT 


AGCTCCTTC6 


GTCCTCCGAT 


CGTTGTCA6A 


AGTAAGTTG6 


7250 


CCGCAGTGTT ATCACTCATG 6TTATGCCAG CACTGCATAA TTCTCTTACT 


7300 


GTCATGCCAt 


CCGTAAGATG 


CTTTTCTGTG 


ACTGGTGAGT 


ACTCAACCAA 


7350 


GTCATTCTGA 


GAATAGTGTA 


TGCGGCGACC 


GAGTTGCTCT 


TGCCCGGCGT 


7400 


CAACACGGGA 


TAATACCGCG 


CCACATAGCA 


CAACTTTAAA 


AGTGCTCATC 


7450 


ATTGGAAAAC 


GTTCTTCGG6 


GCGAAAACTC 


TCAAGGATCT 


TACCGCTGTT 


7500 


6A6ATCCAGT 


TCGATGTAAC 


CCACTCGTGC 


ACCCAACTGA 


TCTTCAGCAT 


7550 


CTTTTACTTT 


CACCAGCGTT 


TCTGGGTGAG 


CAAAAACAGG 


AAGGCAAAAT 


7600 


6CCGCAAAAA 


AGGGAATAA6 


GGCGACACGG 


AAATGTTGAA 


TACTCATACT 


7650 


CTTCCTTTTT 


CAATATTATT 


GAAGCATTTA 


TCAGGGTTAT 


TGTCTCAT6A 


7700 


GCGGATACAT 


ATTTGAATGT 


ATTTAGAAAA 


ATAAACAAAT 


AGGGGTTCC6 


7750 


CGCACATTTC 


CCCGAAAAGT 


GCCACCTGAC 


6TCTAAGAAA 


CCATTATTAT 


7800 


CATGACATTA 


ACCTATAAAA 


ATAGGCGTAT 


CACGAGGCCC 


TTTCGTCTTC 


7850 


AA 
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FIGURE 7A 



TCTTCCGCTT 


CCTCGCTCAC 


TGACTCGCTG 


CGCTCGGTCG 


TTCGGCTGC6 


50 


GCGAGCGGTA 


TCAGCTCACT 


CAAAGGCGGT 


AATACGGTTA 


TCCACAGAAT 


100 


CAGGGGATAA 


CGCAGGAAAG 


AACATGTGAG 


CAAAAGGCCA 


GCAAAA6GCC 


150 


AGGAACCGTA 


AAAAGGCCGC 


GTTGCTGGCG 


TTTTTCCATA 


GGCTCCGCCC 


200 


CCCTGACGAG 


CATCACAAAA 


ATCGACGCTC 


AAGTCAGAGG 


TGGCGAAACC 


250 


CGACAGGACT 


ATAAAGATAC 


CAGGCGTTTC 


CCCCTGGAA6 


CTCCCTCGT6 


300 


CGCTCTCCTG 


TTCCGACCCT 


GCCGCTTACC 


GGATACCTGT 


CCGCCTTTCT 


350 


CCCTTCGGGA 


AGCGTGGCGC 


TTTCTCATAG 


CTCACGCTGT 


AGGTATCTCA 


400 


GTTCGGTGTA 


GGTCGTTCGC 


TCCAA6CTGG 


GCTGTGTGCA 


CGAACCCCCC 


450 


GTTCAGCCCG 


ACCGCTGCGC 


CTTATCCGGT 


AACTATCGTC 


TTGAGTCCAA 


500 


CCCGGTAAGA 


CACGACTTAT 


CGCCACTGGC 


AGCAGCCACT 


GGTAACAGGA 


550 


TTA6CAGA6C 


GAGGTATGTA 


GGCGGTGCTA 


CAGAGTTCTT 


GAAGTGGTGG 


600 


CCTAACTACG 


6CTACACTAG 


AAGAACAGTA 


TTTGGTATCT 


GCGCTCTGCT 


650 


GAAGCCAGTT 


ACCTTCGGAA 


AAAGAGTTGG 


TAGCTCTTGA 


TCCGGCAAAC 


700 


AAACCACCGC 


TGGTAGCGGT 


GGTTTTTTTG 


TTTGCAAGCA 


GCAGATTACG 


750 


CGCAGAAAAA 


AAGGATCTCA 


AGAAGATCCT 


TTGATCTTTT 


CTACGGGGTC 


800 


TGACGCTCAG 


TGGAACGAAA 


ACTCACGTTA 


AGGGATTTTG 


GTCATGAGAT 


850 


TATCAAAAA6 


GATCTTCACC 


TAGATCCTTT^ 


TAAATTAAAA 


ATGAAGTTTT 


900 








TGGTCTGACA 


GTTACCAATG 


950 


CTTAATCAGT 


GAGGCACCTA 


TCTCAGCGAT 


CTGTCTATTT 


CGTTCATCCA 


1000 


TAGTT6CCTG 


ACTCCCCGTC 


GTGTAGATAA 


CTACGATACG 


GGAGGGCTTA 


1050 


CCATCTGGCC 


CCAGTGCTGC 


AATGATACCG 


CCAGACCCAC 


GCTCACCG6C 


1100 


TCCAGATTTA 


TCAGCAATAA 


ACCAGCCAGC 


CGGAAGGGCC 


GAGCGCAGAA 


1150 


GTGGTCCTGC 


AACTTTATCC 


GCCTCCATCC 


AGTCTATTAA 


TTGTT6CC6G 


1200 


GAAGCTAGAG 


TAAGTAGTTC 


GCCAGTTAAT 


AGTTTGCGCA 


ACGTTGTTGC 


1250 


CATTGCTACA 


GGCATCGTGG 


TGTCACGCTC 


GTCGTTTGGT 


ATGGCTTCAT 


1300 
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TCAGCTCCGC TTCCCAACGA TCAAGGCGAG TTACATGATC CCCCATGTTG 1350 

TGCyUUUUUlG CGGTTAGCTC CTTCGGTCCT CCGATCGTT6 TCAGAAGTAA 1400 

GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC 1450 

TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA 1500 

ACCAAGTCAT TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC 1550 

GGCGTCAATA CGGGATAATA CCGCGCCACA TAGCAGAACT TTAAAAGTGC 1600 

TCATCATTGG AAAACGTTCT TCGGGGCGAA AACTCTCAAG GATCTTACCG 1650 

CTGTTGAGAT CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC 1700 

AGCATCTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC 1750 

AAAATGCCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC 1800 

ATACTCTTCC TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT 1850 

CAT6AGCGGA TACATATTTG AATGTATTTA GAAAAATAAA CAAATAGG6G 1900 

TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC CTGACGTCTA AGAAACCATT 1950 

ATTATCATGA CATTAACCTA TAAAAATAGG CGTATCACGA GGCCCTTTC6 2000 

TCTCGCGCGT TTCGGTGATG ACGGTGAAAA CCTCTGACAC ATGCAGCTCC 2050 

CXKSAGACGGT CACAGCTTGT CTGTAAGCG6 ATGCCGGGAG CAGACAAGCC 2100 

CGTCAGGGCG CGTCAGCGGG TGTTGGCGG6 TGTCGGGGCT GGCTTAACTA 2150 

TGCGGCATCA GAGCAGATTG TACTGAGAGT 6CACCATAAA ATPGTAAACG 2200 

TTAATATTTT GTTAAAATTC GCGTTAAATT TTTGTTAAAT CAGCTCATTT 2250 

TTTAACCAAT AGGCCGAAAT CGGCAAAATC CCTTATAAAT CAAAAGAATA 2300 

GCCCGAGATA GGGTTGAGTG TTGTTCCAGT TTGGAACAAG AGTCCACTAT 2350 

TAAAGAACGT GGACTCCAAC GTCAAAGGGC GAAAAACCGT CTATCAGGGC 2400 

GATGGCCCAC TACGTGAACC ATCACCCAAA TCAAGTTTTT TGGGGTCGAG 2450 

GTGCCGTAAA GCACTAAATC GGAACCCTAA AGGGAGCCCC CGATTTAGAG 2500 

CTTGACGGGG AAAGCCGGCG AACGTGGCGA GAAAGGAAGG GAAGAAAGCG 2550 

AAAGGAGCGG GCGCTAGGGC GCTGGCAAGT GTAGCGGTCA CGCTGCGCGT 2600 
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AACCACCACA 


CCCGCCGCGC 


TTAATGCGCC 


GCTACAGGGC 


GCGTACTATG 


2650 


GTTGCTTTGA 


CGTATGCGGT 


GTGAAATACC 


GCACAGATGC 


GTAAGGAGAA 


2700 


AATACC6CAT 


CAGGCGCGAT 


TCGCCATTCA 


GGCTGCGCAA 


CTGTTGGGAA 


2750 


GGCCGATCGG 


TGCGGGCCTC 


TTCGCTATTA 


CGCCAGCTG6 


CGAAAGGGG6 


2800 


ATGTGCTGCA 


AGGCGATTAA 


GTTGGGTAAC 


GCCAGGGTTT 


TCCCAGTCAC 


2850 


GACGTTGTAA 


AACGACGGCC 


AGTGCCAAGC 


TTAAGGT6CA 


CGGCCCACGT 


2900 


GGCCACTAGT 


ACTTCTCGAG 


CTCTGTACAT 


GTCCGCGiSTC 


GCGACGTACG 


2950 


CGTATCGATG 


GCGCCAGCTG 


CAGGCGGCCG 


CCATATGCAT 


CCTAGGCCTA 


3000 


TTAATATTCC 


GGAGTATACG 


TAGCCGGCTA 


ACGTTAACAA 


CCGGTACCTC 


3050 


TAGAACTATA 


GCTAGCCAAT 


TCCATCATCA 


ATAATATACC 


TTATTTTGGA 


3100 


TTGAAGCCAA 


TATGATAAT6 


AGGGGGTGGA 


GTTTGTGACG 


TGGCGCGG66 


3150 


CGTGGGAAC6 


GGGCGGGT6A 


CGTAGGTTTT 


AGGGCGGAGT 


AACTTGTAT6 


3200 


TGTTGG6AAT 


TGTAGTTTTC 


TTAAAATGGG 


AAGTTAC6TA 


ACGT6GGAAA 


3250 


ACGGAAGTGA 


CGATTTGAGG 


AAGTTGTGGG 


TTTTTTGGCT 


TTCGTTTCTC 


3300 


GGCGTAGGTT 


CGCGTGCGGT 


TTTCTGGGTG 


TTTTTTGTGG 


ACTTTAACCG 


3350 


TTACGTCATT 


TTTTAGTCCT ATATATACTC 


GCTCTGCACT 


TGGCCCTTTT 


3400 


TTACACTGTG 


ACTGATTGAG 


CTGGT6CCGT 


GTCGAGTGGT 


GTTTTTTTAA 


3450 


TAGGTTTTCT 


TTTTTACTGG 


TAAGGCTGAC 


TGTTAGGCT6 


CCGCTGTGAA 


3500 




TTGTTCTGGA 


GCGGGAGGGT 


GCTATTTTGC 


CTAGGCAGGA 


3550 


GGGTTTTTCA 


GGTGTTTATG 


TGTTTTTCTC 


TCCTATTAAT 


TTTGTTATAC 


3600 


CTCCTAT6G6 


GGCTGTAATG 


TTGTCTCTAC 


GCCTGCGGGT 


ATGTATTCCC 


3650 


CCCAAGCTT6 


CATGCCTGOi 


GGTCGACTCT 


AGAGGATCC6 


AAAAAACCTC 


3700 


CCACACCTCC 


CCCTGAACCT 


GAAACATAAA 


ATGAATGCAA 


TTGiTTGTTGT 


3750 


TAACTTGTTT 


ATTGCAGCTT 


ATAATGGTTA 


CAAATAAAGC 


AATAGCATCA 


3800 


CAAATTTCAC 


AAATAAAGCA 


'rmrri'CAC 


TGCATTCTAG 


TTGTGGTTTG 


3850 


TCGAAACTCA 


TCAATGTATC 


TTATCATGTC 


TGGATCCCCC 


TAGCTTGCCA 


3900 
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AACCTACAGG 
AATCTTTTAT 
ATTGTTGAGT 
AATTGGACTA 
CAACATTTAT 
TCTCCTCTTT 
TTCCGGTGG6 
GAAGAGGCTC 
CTTTGTTCTC 
ATCCTGTGTT 
AGTTCTTCTA 
CATCAAGCA6 
ATCAACTGCT 
AAAGTCAAGC 
CATCTGCAAC 
AAGTTTTTTC 
TCCAAAGGCT 
CATCGATCTG 
AACAAAGTAC 
GCCAGGACTT 
CTGTGTATTT 
CAGATGTCAT 
GAGTTGGCCA 
TTGGCATGTC 
TCCACATCTA 
CATGGCTAAA 



TGGGGTCTTT 
TTTATCTATG 
CAAAACTAGA 
ATCGCGGGAT 
GCTGCTCTCT 
CAGAGCAGCA 
GAAAGAGCTT 
CTCTCGTTCA 
TTCTATGACC 
CACAGAGAAT 
ATTATTTGGT 
CAAGATCTTC 
TGTGGCCATG 
TTCCCAGGAA 
TTTCCATATT 
TAAATGTTCC 
TTCCTCCACT 
GATTTCTCCT 
TCTTCCCTGA 
ATTGAGAAGG 
TGCTGTGAGA 
CTTTCTTCAC 
TTCTTGTATG 
AATGAACTTA 
TGCTGGAGTT 
GTCAGGATAA 



CATTCCCCCC 
GCTCGTACTC 
GCCTGGACCA 
CAGCCAATTC 
AAAGCCTTGT 
ATCTGGGGCT 
CACCCTGTCG 
GCAGTTTCTG 
AAAAATTGTT 
TACTGTGCakA 
ATGTTACTG6 
GCCTTACTGA 
GCTTAGGACA 
ACTGTTCTAT 
TCTTGATCAC 
AGAAAAAATA 
GTTGCAAAGT 
TCAGTGTTCA 
TCCAGTTCTT 
AAATGTTCTC 
TCTTTGACAG 
GTGTGAATTC 
GTTTGGTTGA 
AAGACTCGGC 
TACAGCCCAC 
TACCAACTCT 



TTTTTCTGGA 
TATAGGCTTC 
CTGATATCCT 
CATGAGCAAA 
ATCTTGCATC 
TAGACTTGCA 
GAGGGGCTGA 
GATGGAATCG 
GGCATTCCAG 
TCAGCAAATG 
ATCCAAATGA 
GAACAGATCT 
CAGCCCCCAT 
CACAGATCTG 
TCCACTGTTC 
AATACTTTCT 
TATTGAATCC 
GTAGTCTCAA 
CCCAAGAGGC 
TAATATGGCA 
TCATTTGGCC 
TCAATAATCA 
CTTGGTAGGT 
TCACAGATCG 
TGCAATGTAC 
TCCTTCTCCT 



GACTAAATAA 
AGCTGGTGAT 
GTCTTTAACA 
TGTCCCATGT 
TCTTCTTCTG 
CTTGCTTGAG 
TGGCTTGCCG 
TACTGCCGCA 
CATTGCTTCT 
CTTGTTTTAG 
GCACTGGGTT 
AGCCAAGCAC 
CCACAAGGAC 
AGCCCAACCT 
ATAGGGATCC 
GTGGTATCAC 
CAAGACACAC 
AAAAGCTGAT 
CCACCCTCTG 
TTTCCACCTT 
CCCTGAGGGC 
TAACTTTC6A 
TTACCTTCTG 
CATCAAGCTA 
TCATGATATT 
TCTCCTGTTG 



3950 

4000 

4050 

4100 

4150 

4200 

4250 

4300 

4350 

4400 

4450 

4500 

4550 

4600 

4650 

4700 

4750 

4800 

4850 

4900 

4950 
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5050 

5100 

5150 

5200 
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TTAAAATGGA 


AATGAAGGTA 


ACAGCAATGA 


AGAAGAT6AC 


AAAAATCATT 


5250 


TCTATTCTCA 


TTTGGAACCA 


GCGCAGTGTT 


GACAGGTACA 


AGAACCAGTT 


5300 


GGCAGTATGT 


AAATTCAGA6 


CTTTGT6GAA 


CAGAGTTTCA 


AAGTAAGGCT 


5350 


6CCGTCCGAA 


GGCACGAAGT 


GTCCATAGTC 


CTTTTAA6CT 


TGTAACAA6A 


5400 


TGAGT6AAAA 


TTGGACTCCT 


GCCTTCAGAT 


TCCAGTTGTT 


TGAGTTGCTG 


5450 


T6AGGTTTG6 


AGGAAATATG 


CTCTCAACAT 


AATAAAAGCC 


ACTATCACTG 


5500 


6CACTGTTGC 


AACAAAGATG 


TAGGGTTGTA 


AAACTGCGAC 


AACTGCTATA 


5550 


GCTCCAATCA 


CAATTAATAA 


CAACTGGAT6 


AAGTCAAATA 


TGGTAA6AG6 


5600 


CAGAAGGTCA 


TCCAAAATTG 


CTATATCTTT 


GGAGAATCTA 


TTAAGAATCC 


5650 


CACCTGCTTT 


cAacgtgttg 


AGGGTTGACA 


TAGGTGCTT6 


AAGAACAGAA 


5700 


TGTAACATTT 


TGTGGTGTAA 


AATTTTCGAC 


ACTGTGATTA 


GAGTATGCAC 


5750 


CAGTGGTAGA CCTCTGAAGA 


ATCCCATAGC 


AAGCAAAGTG 


TC6GCTACTC 


5800 


CCACGTAAAT 


GTAAAACACA 


TAATACXSAAC 


TGGTGCTGGT 


GATAATCACT 


5850 


GCATAGCTGT 


TATTTCTACT 


ATGAGTACTA 


TTCCCTTTGT 


CTTGAAGAGG 


5900 


AGTGTTTCCA 


AGGAGCCACA 


GCACAACCAA 


AGAAGCAGCC 


ACCTCT6CCA 


5950 


GAAAAATTAC 


TAAGCACCAA 


ATTAGCACAA 


AAATTAAGCT 


CTTGTGGACA 


6000 


GTAATATATC 


GAAGGTATGT 


GTTCCATGTA 


GTCACTGCTG 


GTATGCTCTC 


6050 


CATATCATCA 


AAAAAGCACT 


CCTTTAAGTC 


TTCTTCGTTA 


ATTTCTTCAC 


6100 


TTATTTCCAA 


GCCAGTTTCT 




rpmprpm/j « ATA 


TATATCCAGT 


6150 


TCAGTCAAGT 


TTGCCTGAGG 


GGCCAGT6AC 


ACTTTTCGTG 


TGGATGCTGT 


6200 


TGTCTTTCGG 


TGAATGTTCT 


GACCTTGGTT 


AACT6AGTGT 


GTCATCAGGT 


6250 


TCAGGACAGA 


CTGCCTCCTT 


CGTGCCTGAA 


GCGTGGGGCC 


AGTGCTGATC 


6300 


ACGCTGATGC 


GAGGCAGTAT 


CGCCTCTCCC 


TGCTCAGAAT 


CTGGTACTAA 


6350 


GGACAGCCTT 


CTCTCTAAAG 


GCTCATCAGA 


ATCCTCTTCG 


ATGCCATTCA 


6400 


TTTGTAAGGG 


AGTCTTTTGC 


ACAATGGAAA 


ATTTTCGTAT 


AGAGTTGATT 


6450 


GGATTGAGAA 


TAGAATTCTT 


CCTTTTTTCC 


CCAAACTCTC 


CAGTCTGTTT 


6500 
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AAAAGATTGT 
ATGAGAAACG 
CTAAATTGGT 
CTGTAGATTT 
CATGCAAAAT 
GTGACCAAAA 
AAATATTTCT 
ATAAATACAA 
GCTCGTTGAC 
GTCTTTCTCT 
TGACGCTTCT 
TCTTTAATGG 
TCTTCCACTG 
TCATCATTAG 
AACAACTGTC 
ACCAAGAAGT 
TTTTTCTATT 
CCCTCCTCCC 
TAAGTTATAT 
TTTTGTTTAT 
AATTGCCGAG 
GAATATTTTC 
ATAAAAACAC 
TATCTCACAT 
GTTTTCAATC 
ATTGGATGTT 



TTTTTTGTTT 
GTGTAAGGTC 
CGAAAGAATC 
TGGAGTTCTG 
TAATATTTTG 
TCCTAGTTTT 
TTTTCTGTTA 
ATCAGCATCT 
CTCCACTCAG 
GCAAACTTGG 
GTATCTATAT 
TGCCAGGCAT 
TGCTTAATTT 
AAGTGAAGTC 
CTCTTTCTAT 
GAGAAATTAC 
GTTATTGTTT 
AGAAGGCTGT 
TCCAATGTCT 
TGCTCCAAGA 
TGACCGCCAT 
CGGAGGATGA 
CACAAAGAAC 
AGGCTGCCTT 
ATTTTTTCCA 
CTCGATCATT 



CTGTCCAGGA 
TCAGTTAGGA 
ACATCCCATG 
AAAATGTCCC 
TCAGCTTTCT 
GTTAGCCATC 
AAACATCTAG 
TTGTATACTG 
TGTGATTCCA 
AGATGTCCTC 
TCATCATAGG 
AATCCAGGAA 
TACCCTCTGA 
TTGCCTGCTC 
CTTGAAATTA 
TGAAGAAGAG 
TGTTTTGCTT 
TACATTCTCC 
TATATTCTTG 
GAGTCATACC 
GCGCAGAACA 
TTCCTTTGAT 
CCTGAGAAGA 
CCGAGTCAGT 
TTGCTTCTTC 
TCTGAGGTAA 



GACAGGAGCA 
TTGAATTTCT 
AGTTTTGAGC 
ATAAAAATAG 
TTAAATGTTC 
AGTTTACAGA 
GTATCCAAAA 
CTCTTGCTAA 
CCTTCTCCAA 
TTCTAGTTGG 
AAACACCAAA 
AACTGAGAAC 
AGGCTCCAGT 
CAGTGGATCC 
ATATCTTTCA 
GCTGTCATCA 
TCTCAAATAA 
ATCACTACTT 
CTTTTGTAAG 
ATGTTTGTAC 
ATGCAGAATG 
TAGTGCATAG 
AGAAGGCTGA 
TTCAGTTCTG 
CCAGCAGTAT 
TCACAAGTCT 



TCTCCTTCTA 
TCTTTCTGCA 
TAAAGTCTGG 
CTGCTACCTT 
CATTTTAGAA 
CACAGCTTTC 
GGAGAGTCTA 
AGAAATTCTT 
GAACTATATT 
CATGCTTTGA 
GATGATATTT 
AGAATGAAAT 
TCTCCCATAA 
AGCAACCGCC 
GGACAGGA6T 
CCATTAGAAG 
TTCCCCAAAT 
CTGTAGTC6T 
AAATCCTGTA 
AGCCCAGGGA 
AGATGGTGGT 
GGAAGCACAG 
GCTATTGAAG 
TTTGTCTTAA 
GCCTTAACAG 
TTCACTGATC 



6550 

6600 

6650 

6700 

6750 

6800 

6850 

6900 

6950 

7000 

7050 

7100 

7150 

7200 

7250 

7300 
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7400 
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7500 

7550 
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7650 

7700 

7750 

7800 



SUBSnnilESHEEr flfflLESB) 



W09fi/13S97 



PCTA)S9S/14017 



23/45 
FIGURE 76 



TTCCCAGCTC 


TCTGATCTCT 


GTACTTCATC 


ATCATTCTCC 


CTAGCCCAGC 


7850 


CTGAAAAAG6 


6CAAGGACTA 


TCAGGAAACC 


AA6TCCACAG 


AAGGCAGACG 


7900 


CCTGTAACAA 


CTCCCAGATT 


AGCCCCAT6A 


66AGTGCCAC 


TTGCAAAGGA 


7950 


GCGATCCACA 


CGAAATGTGC 


CAATGCAAGT 


CCTTCATCAA 


ATTTGTTCA6 


8000 


GTTGTTGGAA 


AGGAGACTAA 


CAAGTTGTCC 


aatacttatt 


TTATCTAGAA 


8050 


CAC6GCTTGA 


CAGCTTTAAA 


GTCTTCTTAT 


AAATCAAACT 


AAACATAGCT 


8100 


ATTCTCATCT 


GCATTCCAAT 


GTGATGAA6G 


CCAAAAATG6 


CTGGGTGTA6 


8150 


GAGCAGTGTC 


CTCACAATAA 


AGAGAAGGCA 


TAAGCCTATG 


CCTAGATAAA 


8200 


TCGCGATAGA 


GCGTTCCTCC 


TTGTTATCCG 


GGTCATAGGA 


AGCTATGATT 


8250 


CTTCCCAGTA 


AGAGAGGCTG 


TACTGCTTTG 


GTGACTTCCC 


CTAAATATAA 


8300 


AAAGATTCCA 


TAGAACATAA 


ATCTCCAGAA 


AAAACATC6C 


CGAAGGGCAT 


8350 


TAATGAGTTT. 


A6GATTTTTC 


TTT6AAGCCA 


6CTCTCTATC 


CCATTCTCTT 


8400 


TCCAATTTTT 


CAGATAGATT 


GTCA6CAGAA 


TCAACAGAAG 


GGATTTGGTA 


8450 


TATGTCTGAC 


AATTCCAGGC 


GCTGTCTGTA 


TCCTTTCCTC 


AAAATTGGTC 


8500 


TGGTCCAGCrr 


GAAAAAAAGT 


TTGGAGACAA 


CGCTGGCCTT 


TTCCAGAGGC 


8550 


GACCTCTGCA 


TGGTCTCTC6 


GGCGCTGGGG 


TCCCTGCTA6 


6GCCGTCTGG 


8600 


GCTCAAGCtC 


CTAAT6CCAA 


A6GAATTCCT 


GCAGCCCGGG 


GGATCCACTA 


8650 


GTTCTA6A6C 


GGCCGCCACC 


6C6GTGGCTG 


ATCCCGCTCC 


CGCCCGCCGC 


8700 


oWV9 w 1. ± Www X 


XXX X AX A\9VV 






CCATAAAAG6 


8750 


AAACTTTCGG 


AGCGCGCCGC 


TCTGATTGGC 


TGCCGCCGCA 


CCTCTCCGCC 


8800 


TCGCCCCGCC 


CCGCCCCTC6 


CCCCGCCCCG 


CCCCGCCTGG 


CGCGCGCCCC 


8850 


CCCCCCCCCC 


CCGCCCCCAT 


CGCTGCACAA 


AATAATTAAA 


AAATAAATAA 


8900 


ATACAAAATT 


GGGGGTGGGG 


AGGGGGGGGA 


GATGGGGAGA 


GTGAAGCAGA 


8950 


ACGTGGCCTC 


GAGTAGATGT 


ACTGCCAAGT 


AGGAAAGTCC 


CATAAGGTCA 


9000 


TGTACTGGGC 


ATAATGCCAG 


GCGGGCCATT 


TACCGTCATT 


GACGTCAATA 


9050 


GGGGGCGTAC 


TTGGCATAT6 


ATACACTT6A 


TGTACTGCCA 


AGTGGGCA6T 


9100 
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TTACCGTAAA TACTCCACCC ATTGACGTCA ATGGAAAGTC CCTATTGGC6 9150 

TTACTATGGG AACATACGTC ATTATTGACG TCAATGGGCG GGGGTCGTTG 9200 

GGCGGTCAGC CAGGCGGGCC ATTTACCGTA AGTTATGTAA CGACCTGCAG 9250 

GCTGATCTCC CTAGACAAAT ATTACGCGCT ATGAGTAACA CAAAATTATT 9300 

CAGATTTCAC TTCCTCTTAT TCAGTTTTCC CGCGAAAATG GCCAAATCTT 9350 

ACTCGGTTAC 6CCCAAATTT ACTACAACAT CCGCCTAAAA CCGCGCGAAA 9400 

ATTGTCACTT CCTGTGTACA CCGGCGCACA CCAAAAACGT CACTTTTGCC 9450 

ACATCCGTCG CTTACATGTG TTCOGCCACA CTTGCAACAT CACACTTCCG 9500 

CCACACTACT ACGTCACCCG CCCCGTTCCC ACGCCCCGCG CCACGTCACA 9550 

AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA TAAGGTATAT 9600 

TATTGATGAT GCTAGCATGC GCAAATTTAA AGCGCTGATA TCGATCGCGC 9650 

GCAGATCTGT CATGATGATC ATTGCSATTG GATCCATATA TAGGGCCCGG 9700 

GTTATAATTA CCTCAGGTCG ACGTCCCATG GCCATTCGAA TTCGTAATCA 9750 

TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACai 9800 

CAACATACGA GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG 9850 

TGAGCTAACT CACATTAATT GCGTTGCGCT CACTGCCCGC TTTCCAGTC6 9900 

GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GC6CGGGGAG 9950 

QQ72 

AGGCGGTTTG CGTATTGGGC GC 
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FIGURE 12A 

CCAATTCCAT CATCAATAAT ATACCTTATT TTGGATTGAA GCCAATATGA 
TAATGAGGGG GTGGAGTTTG TGACGTGGC6 CGGGGCGTG6 GAACGGGGCG 
GGTGACGTAG GTTTTAGGGC GGAGTAACTT GTATGTGTT6 GGAATTGTA6 
TTTTCTTAAA ATGGGAAGTT ACGTAACGTG GGAAAACGGA AGTGACGATT 
TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT TTCTGGGCGT AGGTTCGCGT 
GCGGTTTTCT GGGTGTTTTT TGTGGACTTT AACCGTTAC6 TCATTTTTTA 
GTCCTATATA TACTCGCTCT GCACTTGGCC CTTTTTTACA CTGTGACTGA 
TTGAGCTGGT GCCGTGTCGA GTGGTGTTTT TTTAATAGGT TTTCTTTTTT 
ACTGGTAAGG CTGACTGTTA GGCTGCCGCT GTGAAGCGCT GTATGTTGTT 
CTGGAGCGGG AGGGTGCTAT TTTGCCTAGG CAGGAGGGTT TTTCAGGTGT 
TTATGTGTTT TTCTCTCCTA TTAATTTTGT TATACCTCCT ATGGGGGCT6 
TAATGTTGTC TCTACGCCTG CGGGTATGTA TTCCCCCCAA GCTT6CATGC 
CTGCAGGTCG ACTCTAGAGG ATCCGAAAAA ACCTCCCACA CCTCCCCCTG 
AACCTGAAAC ATAAAATGAA TGCAATTGTT GTTGTTAACT TGTTTATTGC 
AGCTTATAAT GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA 
AAGCATTTTT TTCACTGCAT TCTAGTTGTG GTTTGTCCAA ACTCATCAAT 
GTATCTTATC ATGTCTGGAT CCCCGCGGCC GCTPCTAGAAC TAGTGGATCC 
CCCGGGCTGC AGGAATTCCG TAACATAACT GCGTGCTTTA TTGAGATACA 
CAGTAAAGCA GTAATATAAT ACAATAGTAA GCCATATATT TGGTGAAATC 
TGATATGTTG TGAAAATGCA GTAAAACTGA AGTTTAAAAA AATAATTAGT 
AAATGTTACA GTGTTGGTGT TAAAACACAA TCTATTATGA TACTCAAGTA 
AGAGTCCAGT ACCTGGAGAC AATGATGATA CATGCCATGT GATGATTATG 
CTTCAGTTAC ACTGATTATG ATTTACACTT TAATACTTGA TGGTTATAAA 
GAACATGAAA TGATGTCCAA ATTATGCTTA AAATCAGCAA TAAAGCTCTC 
AGTTTTTATT CAAATATTTT GATAGATTCA CTCCAGAACT AATATCTAAA 
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ATGACTTGCA 


TTTGTTTAGA 


CATTTCAGCC 
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CGTCAACTA6 


ACCAGAGAGT 


TTGGAGACGC 


TTTTGCTCTC 


AAAACTTTCC 


1900 


AACCACTGT6 


CCTTCTCACC 


CACAATCCTG 
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CTTGCAGGGA 
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TGGGCTTCTG 


GACTGATATT 


2000 


GACCA6GGTC 


ACAATGTGAT 


TGGGTTACTT 
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AATCCTAAGT 
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ATGAAGAAAA 


AAAAACCCAA 


2100 
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ATCTAAAATT 


CATCATCATC 


ATCAACAACA 
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TCAGGTTGAG 


TTTATGAAGA 


GGGCAGAACA 
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ATTTAGTTGT 
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ATGTTTATAT 
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AAATATTCAT 
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ATATAAATT6 


ACTGAGGAGC 
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GAGTTAAAAA 
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TGTCATCTGA 


TGTTCTATTG 


CAGAAACATC 


ATTAAATATA 


ACCCAACA6T 


2400 


AGGAAGTTGA 


ATATATCAGC 
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CTATGATAGT 


AAGTCCTGTG 
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CATGTTCCTT 


GAAAAAAATG 


AATCCTCTAG 


CTCTCAGTGG 


2500 
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AAAGTTTAAA ACTAGAAACA TCTGGAGCCC TAGACAATAT TTTAGTGTGG 
CGGTA6TCTC CT6GCTTTGG GCTCCAGGGA AAATTCACTC TTGCCCAAGC 
AGATAAGCCC AGATGACTA6 AA6CAATTTC CATTAGGAAG TGGCAAGAAC 
ATTTGAAGAA GTAACTTCAT ATCTATTTAT CTATATACCT ATAGTATTTA 
TATACTTGTA GACATATAGA T6TATAAAAT GAAAGCCCAT AOCCAGCCCC 
ACTCAGTCAA CAATTCTCAA AAGAGCAATA TGAAGCAGTC ATTTGGTGGG 
GTTCGTATGC AAGAAAATAA AAAAACGTCA TGAATTCCAT ATGAATACCA 
CGCTAAAGTA ATGCAAAACA ATGTGCTGCC TCAGTGTGTG TGTGTGTGTG 
TGTGTGTGT6 GTGGGTTCGT GCATGTATGT GTGCGTGTGT GTGTGTGTGT 
GTGTGTGTGT GTGTGTGTGC GTGTGTGTTT GTTTAGGGGT TTTTATAAAC 
AACTTTTTTT ATAAAGCACA CTTTAGTTTA CAATCTCTCT TTATAACTGT 
TATAAATTTT TAAACAACCC AAAATGCGTT CCATATAAAG AAATGGCAAG 
TTATTTAGCT ATCAAGATTT TACATGTTTT CTTTTAACTT TTTTGTACAA 
TTGCATAGAC GTGTAAAACC TGCCATTGTT AACAJUU^CAA TAACAGACTT 
AGAAACTACT GAAATCTACA GTATAGTACC ACTACCCTTC ACAAAAATAT 
AGATTTTATT TCTTGTAAAC TCTTACTGTC TAATCCTCTT TGTTGTACGA 
ATATTATAAA AACCATGCGG GAATCAGGAG TTGTAAAACA TTTATTCTGC 
TCCTTCTTCA TCTGTCATGA CTGAAACTAA GGACTCCATC GCTTCTGCCCA 
AATCATCTGC CATGTGGAAA AGGCTTCCTA CATTGTGTCC TCTCTCATTG 
GCTTTCCGGG GGCATTTCTT CCTCTTGAAC TAGGGAAGGA GTTGTTGA6T 
TGCTCCATCA CTTCTTCTAA CCCTGTGCTT GTGTCCTGGG GAGGACTCAG 
AAGATCTTCC TCACCCATAG ATTCTGAAGT TTGACTGCCA ACCACTCGGA 
GCAGCATAGG CTGACTGCTA TCTGACCTCT GCAGAGAGGT GGAAGGAGAG 
GACACCGTG6 TGCCATTCAC CTTAGCTTCA GCCTGGGGCT GCTCCAGGAG 
CTGTCTCAGT CTATGTAACT GAGACTCCyiG CTGTTTATTG TGGTCTTCCa 
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GGATTTGCAT 


CCTGGCTTCC 


AGGCGTCCTT 


TGTGTTGGCG 
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ACTCTACCAT 
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ATTTTATGGC 


CCTTTGCAAC 
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TCGGCCAGAA 


AAAAAGCAAC 
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GTCATAATTA 


AAATGCTTTA 
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GGCTTCTGTA 


CCTGAATCCA 


ATGATTGGAC 


ACTCCTTACA 


GATGTTACAC 
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TTGGCTTGAT 


GCTTGGCAGT 


TTCAGCAGCA 


GCCACTCTGT 


GCAAGACGG6 
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CAGCCACACC 


ATAGACT6GG 


GTTCCAGGCG 


CATCCAGTCA 


AGGAAGAGAG 


4650 


CAGCTTCAAT 


CTCAGGTTTA 


TTATTGGCAA 


ATTGGAAGCA 


GCTCCTGACA 




CTCGGCTCAA 


TGTTACTGCC 


CCCAAAGGAA 


GCAACTTCAC 


CCAACTGTCT 
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TGGGATTTGA 


ATAGAATCAT 


GCAGAAGAAG ACCCAGCCTA CGCTGGTCAC 
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AAAAGCCAGT 


TGAACTTGCC 


ACTTGCTTGA 


AAAGGTATCT 


GTACTTGTCT 
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AGAAATGAT6 


CCAGTTTTAA 


AAGACAGGAC 


4900 


ACGGATCCTC 


CCTGTTCGTC 


CCGTATCATA 


AACATTGAGA 


AGCCAGTTGA 
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GACACATATC 


CACACAGAGA 


GGGACATTGA 


CCAGATTGTT 


GTGCTCTTGC 


5000 


TCCAGACGAT 


CATAAATTGT 


AGTCAAACAG 


TTAATTATCT 


GCAGGATATC 


5050 
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FIGURE 12E 

CATGGGCTGG TCATTTTGCT TGAGGTTGTG CTGGTCCAGG GCATCACATG 5100 
CAGCTGACAG GCTCAAGAGA TCCAAGCAAA GGGCCTTCTG GAGCCTTCT6 5150 
AGCTTCATGG CAGTCCTATA CGCGGAGAAC CTGACATTAT TCAGGTCAGC 5200 
TAAAGACTGG TAGAGCTCTG TCATTTTGGG GTGGTCCCAA CAAGTGGTTT 5250 
GGGTCTCGTG GTTGATATAG TAGGGCACTT TGTTTGGTGA GATGGCTCTC 
TCCCAGGGAC CCTGAACTGA AGTGGAAAGG AAGTGCTGGG ATGCAGGACC 
AAAGTCCCTG TGGGCTTCAT GCA6CTGTCT GACACGGTCC TCCACAGCCA 
CCTGTAGAAG CCTCCATCTG GTATTCAGAT CTTCCAAAGT GCTGAGGTTA 
TAAGGTGAGA GCTGAATGCC CAGTGTGGTC AGCTGATGTG CAAGGTCATT 
GACACGATTG ACATTCTCTT TAAGAGGTGC AATTTCTCCC CGAAGTGCCT 
TGACTTTTTC AAGGTGATCT TGCAGAGAGT CAATGAGGAG ATCCCCCACT 
GGCTGCCAGG ATCCCTTGAT CACCTCAGCT TGGCGCAACT TGAGGTCCAG 
TTCATCGGCA GCTTCCTGAA GTTCCTGGAG TCTTTCAAGA GCTTCATCTA 
TTTTTCTCTG CCAATCAGCT GAGCGCAGGT TCAATTTGTC CCATTCAGC6 
TTGACCTCTT CAGCCTGCTT TCGTAGGAGC CGAGTGACAT TCTGAGCTCT 
TTCTTCAGGA GGCAGTTCTC TGGGCTCCTG GTAGAGTTTC TCTAGTCCTT 
CCAAAGGCTG CTCTGTCAGA AATATTCTCA CAGTCTCCAG AGTACTCATG 
ATTACAGGTT CTTTAGTTTT CAATTCCCTC TTGAAGGCCC TATGTATATC 
ATTCTGCTTC TGAACTGCTG GGAAATCACC ACCGATGGGT GCCTGACGGC 
TCAGTTCATC ATCTTTCAGC TGTAGCCAAA CAAGAAGTTC CTGAAGAGAA 
AGATGCAAAC GCTTCCACT6 GTCAGAACTT 6CTTCCAAAT GGGACCTAAT 
GTTGAGAGAC TTTTTCTGAA GTTCACTCCA CTTGAAATTC ATGTTATCCA 
AACGTCTTTG TAACAGGGGT 6CTTCATCC6 AACCTTCCAG GGATCTCAGG 
ATTTTTTGGC CATTTTCATC AAGATTGTGA TAGATATCTG TGTGAGTTTC 
AATTTCTCCT TGGAGATCTT GCCATGGTTT CATCAGCTCT CTGACTCCCC 
TGGAGTCTTC TAGGAGCTTC TCCTTACGGG AAGCGTCCTG TAGGACATTG 
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CAAGTTTATC 
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TTGCTCTTCT GGCCTTATGG GAGCACTTAC AAGTACTGCT CCTCCTGTTT 7650 

CATTTAATTG TTTTAGAATT CCCTGGCGCA GGGGCAACTC TTCTGCCA6T 7700 

AACTTGACTT GTTCAAGTTG TTCTTTTAGC TGCTGCTCAT CTCCAAGTOG 7750 

AGTAATAGCA ATGTTATCTG CTTCTTCCAG CCACAAAACA AATTCATTTA 7800 

AATCTCTTTG AAATTCTGAC AAGACATTCT TTTGTTCTTC AATCCTCTTT 7850 

CTCCTTTCT6 CCAGCTCTTT GCAGATGTCG T6CCACCGCA GACTCAAGCT 7900 

TCCTAATTTT TCTTGTAGAA TATTGACyVTC TGTTTTTGAA GACTGTTGAA 7950 

TTATTTCTTC CCCAGTTGCA TTCAGTGTTC TGACAACAGC TTGAOGCTCC 8000 

CCAATGCCAT CCTGGAGTTC CTTAAGATAC CATTTGTATT TAGCATGTTC 8050 

CCAGTTTTCA GGATTTTGTG TCTTTTTGAA AAACTGTTCA ACTTCATTCA 8100 

GCCATTGATT AAATACCTTC ATATCATAAT GAAAGTGTCG CCATTTTTCA 8150 

ACTGATCTGT CGAATCGCCC TTGTCGTTCC TTGTACATTC TATGAA6TTT 8200 

TTCCCCCTGG AAATCCATCT 6TGCCACGGC TTCCTGTACT TTCACCTTTT 8250 

CCATGGAGGT GGCACTTT6C AAGGCTGCTG TCTTCTTCTT GTGAATAATA 8300 

TCAATCCGAC CTGAGATTTG TTGCAAATTG TCTTTTATAT TCTTAAGAGA 8350 

CTCCTCTTGC TTAAAAAGAT CTTCAAAATC TTTA6CACA6 AGTTCAGGA6 8400 

TATTTAGAAG ATGATCAACT TCTGAAAGAG CTTGTAAGAT ATGACTGATC 8450 

TCCGTCAAAT AAGTAGAAGG CACATAAGAA ACATCCAAAG GCATATCTTC 8500 

AGTCGTCACT ACCATAGTTT CTTCATGGAG AGTGTGAATT TGTGCAAAGT 8550 

TGAGTCTTCG AAACTGAGCA AAATTGCTCT CAATTTGCCG CCAGCGCTTG 8600 

CTGAGCTGGA TCTGAGTTGG CTCCACTGCC ATTGCGGCCC CATTCTCAGA 8650 

CAAGCCCTCA GCTT6CCTGC GCACTGCATT CA6CTCCTCT TTCTTCTTCT 8700 

GCAATTCACG ATCAATTTCC TTTAATTTTC TTTCATCTCT GGGTTCAGGT 8750 

AGGCTGGCTA ATTTTTTTTC AATTTCATCC AAGCATTTCA GGAGATCATC 8800 
AGCCTGCCTC TTGTACTGAT ACCACTGGTG AGAAATTTCT AGGGCCTTTT 
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GGCATCCTTC 
GACTTGTCAA 
GCATCCACCT 
AAGCGACTCC 
GGATACTCTG 
TCCCTCCAAC 
CATGACTCCT 
GGTTCTCCTC 
GTGATTTCCT 
TTCTACTTCA 
CATGCCAACA 
CTGGTGCACA 
TAAGGCCTCT 
TCTCAGTAAG 
GCTCTCTTCA 
TTCAAAATCT 
GCATCTCTGA 
TTTAAGGCTT 
AGTGTTAAGC 
CAGCTTCACT 
GGCTGAATTG 
TTTTTTCAGG 
AAACATCAAC 
AATTTTCGAA 
CAACTGGGAG 



CCCTGGTTAT 
ATCTGATTGG 
TGTCAGTGAT 
TGAATTAAGT 
TTCAAGCAAC 
GAGAATTAAA 
CCATCTGTAA 
TGAATGATGC 
CAGGTCCTGC 
TTGAGCCACT 
T6CCCAAACT 
GCCATTGGTA 
T6TGCTGAGG 
GAGTTTCACT 
TTTCTTCAAC 
CTCTCTAGAT 
TAGATCTTTT 
CCTTTCTGGT 
TCTCTAAGTT 
CTTTATCTTC 
TTTGAATATC 
ATTTCAGCAT 
TTCAGCCATC 
GTTTATTCAT 
GAAAGTTTCT 



GTTTCTTCAT 
ATTTTCTGGG 
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TTTTGTTTCC 
CGTCTCAAGC 
GAGTCTGTGC 
ATCAGATTTT 
AGGAACATTT 
TGTTTGCTTT 
TCTTCCAAAG 
GTTGGTGGTC 
6TGGAGCGTG 
TTAGTTTCTT 
AGCAGTCTGT 
ATTCTTCTTC 
TGGAGGCTTA 
GTAGACCTGG 
CTGTCTCCAG 
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ACCAACTAAA 
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CATTTCTGTA 
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TCCAGTGCCC 



TTCTTCTAAA 
CTTCCTGAG6 
AACTGCTTGT 
TTCAATTTCC 
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CAAGAGATTC 
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CTCTAAATAT 
TTTTGCATTT 
AGAGTTTCAA 
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AATTCATCTG 
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CTCAATCTCT 
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TTCATGTAGT 
TCAGTTCATC 
ATCTGATTTG 
TAGCACTTCA 
TAAGTTTCAA 
GACAATAACT 
TCCATTCAGC 
GTTCCTTTTT 
CTATTTACAG 
CTCTTCTTTA 
GAGTTTTATA 
ATCCACTCAT 
CAAACCTGCC 
GATCCCACTG 
AACTC7AGTT 
ATTTAAACTG 
GTTTGAGCTG 
TCTTTCAGGA 
GTGATTCTGA 
AGCTTTCCAC 
TCAAATTCTG 
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FIGURE 


12J 






ACAGATATTT 


CTGGCATATT 


TCTGAAGGT6 


CTTTCTTGGC 


CATCTCCTTC 


11450 


ACAGTGTCAC 


TCAGATAGTT 


GAAGCCATTT 


TGTTGCTCTT 


TCAAAGAACT 


11500 


TT6CAGA6CC 


TGTAATTTCC 


CGAGTCTCTC 


CTCCATTATT 


TCATATTCA6 


11550 


TAACACTAAG 


ATAAGGTACA 


GAGAGTTTGC 


TTTCTGACTG 


CTGGATCCAC 


11600 


GTCCTGATGC 


TACTCATTGT 


CTCCTGATAG 


CGCATTGGTG 


GTAAAGTGTC 


11650 


AAAAATTGTC 


TGTAGCTCTT 


TCTCTTTGGC 


CCTCACACCA 


TCAAAGAT6T 


11700 


GGTTAAAAT6 


ATTAGTAAAG 


GCCACAAAGT 


CTGCATCCA6 


AAACATTGGC 


11750 


CCCTGTCCCT 


TTTCTTTCAG 


TTGTAGACTC 


TGAATTTTTA 


ATTGCTCAAT 


11800 


TTGAGGCTGA 


AGAGCTGACA 


ATCTGTTGAC 


TTCATCCTTA 


CAAATTTTTA 


11850 


ACTGGCTTTT 


AATTGCTGTT 


GGCTCTGATA 


GGGTGGTAGA 


CTGGGTTTTC 


11900 


AACAAGTTTT 


CGGCAGTAGT 


TGTCATCT6T 


TCCAATTGTT 


GTAGCTGATT 


11950 


ATAAAAGGTA 


ATGATGTTGG 


TTTGATACTC 


TAGCCAGTTA 


ACTCTCTCAC 


12000 


TCAGCAATT6 


GCAGAATTCT 


GTCCACC6GC 


TGTTCAGTT6 


TTCTGAAGCT 


12050 


TGTCTGATAC 


TTTCAGCATT 


AACACCCTCA 


TTTGCCATCT 


GTTCCACCAG 


12100 


GGCCTGAGCT 


GATCTGCTG6 


CATCTTGCAG 


TTTTCTGAAC 


TTCTCTGCTT 


12150 


TTTCTCGTGC 


TATGGCATTG 


ACTTTTTCTT 


GCAAGTCTGA 


GATGTTGCCT 


12200 


TCTTTTCGAT 


AGACTGCAAA 


TTCAGAACTC 


TGTAATACAG 


CTTCTGAAC6 


12250 


AGTAATCCAA 


CTGTGAAGTT 


CAGTTATATC 


GACATCCAAC 


CTTTTCCTGA 


12300 


GTTCAGAATC 


CACAGTTATC 


TGCCTCTTCT 


TTTGAGGAGG 


TGGTGGTGGA 


12350 


AGTTCCTCTT 


GGGCATGTTT 


TACCATGATT 


TGTTCCCTTG 


TGGTCACCAT 


12400 


AGTTACCGTT 


TCCATTACAG 


TTGTCTGTGT 


TAGGGATGGT 


TGAGTGGTG6 


12450 


TGACAGCCTG 


TGAAATTT6T 


GCTGAACTCT 


TTTCAAGTTT 


TTGGGTTAAA 


12500 


TTGTCCCAAC 


GTTGTGCAAA 


GTTTTCCATC 


CAGATTTCCA 


TCTTTTGAGT 


12550 


CACTGACTTA 


TTTTTCAGTG 


CCGAAAGTAG 


ATCTTGATTG 


AGTGAACTTA 


12600 


GTTTTTCCAT 


GGTTGGCTTT 


TTCTTTTCTA 


GATCTATTTT 


TAAAGTAGAT 


12650 
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FIGURE 12K 

ATTTTGTGJVA GACTTGACAT CATTTCATTT TGATCTTTAA AGCCACTTGT 
CTGAATGTTC TTCATTGCAT CTTCTTTTTC TGAAAGCCAT GTACTAAAAA 
GGCACTGTTC TTCAGTAAAA TGCTGCCATT TTAGAAGAAT ATCTTGTAAA 
ACAATCCAGC GGTCTTCAGT CCATCTGCAG ATATTTGCCC ATCGATCTCC 
CAGTACCTTA AGTTGTTCTT CCAAAGCAGC TGTTGCATGA TCACCGCTGG 
ATTCATCAAC CACTACTACC ATGTGAGTGA GCGA6TT6AC CCT6ACCTGC 
TCCTGTTCTA GATCTTCTTe AAGCACCTTA TGTTGTTGTA CTT6GCATTT 
TAGATCTTCA AGATCAGGTC CAAAGGGCTC TTCCTCCATT TTCTTAGTTC 
TCTCTTCAGT TTTTGTTAAC CAGTCATCTA GTTCTTTTAA TTTCTGATTC 
TGGAGATCCA TTAGAACTTT GTGTAATTTG CTTTGTTTTT CCATGCTAGC 
TACCCTGAGA CATTCCCATC TTGAATTTAG GAGATTCATT TGTTCTTGCA 
CTTCAGCTTC TTCATCTTCT GATAATTTCC CTTTTCCAAC TAGTTGACTT 
CCTAACTGTA GAACATTACC AACAAGTCCT TGATGAGATG TCAGATCCAT 
CATGAATCCC TCATGAGCAT GAAACTGTTC TTTCACTTCT TCAACATCAT 
TTGAAATCTC TCCTTGTCCT CGCAATGTAT CCTCGGCAGA AAGAAGCCAT 
GAAAGTACTT CTTCTAAAGC AGTTTGGTAA CTATCCAGAT TTACTTCCGT 
CTCCATCAAT GAACTGTCAA GTGACTTGTC TCTGGGAGCT TCCAAATGCT 
GTGAAGGATA GGGGCTCTGT GTGGAATCAG AGGTGGCAAC ATAAGCAGCC 
TGTGTGAAGG CATAACTCTT GAATCGAGGC TTAGGAGATG AAGAAGTTT6 
TTCATAGCCC TGTGCTAGAC TGACTGTGAT CTGTTGAGAG TAATGCATCT 
GGTGATGTAA TTGAAAATGT TCTTCTCTA6 TtACTTTTGA AGATGTCCTG 
GGCAACATTT CCACTTCTT6 AATGGCTTCA ATGCTCACTT GTTGTGGCAA 
AACTTGAAAG AGTGATGTGA TGTACATTAA GATGGACTTC TTGTCTGGAT 
AAGTGGTAGC AACATCTTCA GGATCAAGAA GTTTTTCTAT GCCTAACTGG 
CATTTTGCAA TGTT6AAGGC ATGTTCCAGT CTTTGGGTGG CTGAGTGCTG 
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TGAAACCACA 

GAGCATTCAA 

TTAACCTGTG 

CTTTTCACTG 

TTTTGACCTG 

TTATGATTTC 

ATTATTTTTC 

GAACTCTTGT 

TCCAAGAGGT 

GTCTATGTGT 

TGAATGTTTT 

TCCCACCAAA 

ATCTTGGTAA 

GATGAGAAGC 

TAGCACTTCA 

AGAGCGGAAT 

GGGTACAATT 

TAGAAGTAAA 

CACCGGATCC 

TAACTTTCT6 

AGGCTGGATC 

GCGTCTCCAG 

CCTCCCACCG 

TTACGACATT 

ATTGACGTCA 

ATCCACGCCC 



CTATTCCAAT 

AGCCAACCCG 

GATAATTACG 

TTGGTTTGCT 

CCAGTGGAGG 

CATCCACTAT 

TGTAAGACCC 

AGATCCCTTT 

CTAGGAGGCG 

TGCTTTCCAA 

CTTTTGAAGA 

GCATTTG6AA 

AAGTTTCTCC 

CAATAAACTT 

AGTCTTCCTA 

TCCTGCAGCC 

CCGCAGCTTT 

GGCAACATCC 

GGGACCTGAA 

GTTTTTCAGT 

GGTCCCGGT6 

GCGATCTGAC 

TACACGCCTA 

TTGGAAAGTC 

ATGGGGTGGA 

ATTGATGTAC 
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FIGURE 12L 
CAAACAGGTC GGGCCTGTGA 
TCGGACCAGC TAGAGGTGAA 
TGTTGACTGT CGAACCCAGC 
GCAATCCAGC CTTGATAGTT 
ATTATATTCC AAATCAAACC 
GTCAGTGCTT CCTATATTCA 
GCAGTGCCTT GTTGACATTG 
TCTTTTGGCA GTTTTTGCCC 
TTTTCCATCC TGCAGGTCAC 
ACTTAGAAAA TTGTGCATTT 
TCTTCTCTTT CATAACAGTC 
GAAAAAGTAT ATATCAAGGC 
CAGTTTTATT GCTCCAGGAG 
CAGCAGCCTT GACAAAAAAA 
TTCGTTTTTT CTATAAAGCT 
CGGGGGATCC ACTAGTTCTA 
TAGAGCAGAA GTAACACTTC 
ACTGAGGAGC AGTTCTTTGA 
ATAAAAGACA AAAAGACTAA 
TCCTCGAGTA CCGGATCCTC 
TCTTCTATGG AGGTCAAAAC 
GGTTCACTAA ACGAGCTCTG 
CCGCCCATTT GCGTCAATGG 
CCGTTGATTT TGGTGCCAAA 
GACTTGGAAA TCCCCGTGAG 
TGCCAAAACC GCATCACCAT 



CTATGGATAA 
GTTGATGAC6 
TCAGAAGAAT 
TTCATCACAT 
AAGAGTGA6T 
CTAAATCAAC 
TTCAGGGCAT 
TGTAAGGCCT 
TGAAGAGGTT 
ATCCATTTTG 
CTCTACTTCT 
AGGGATAAAA 
GCTTAGGTAC 
AAAAAAAAAA 
ATT6CCTTCA 
GAGCGGC06C 
CGTACAGGCC 
TTTGCACCAC 
ACTTACCAGT 
TAGAGTCCG6 
AGCGTGGAT6 
CTTATATAGA 
GGCGGAGTT6 
ACAAACTCCC 
TCAAACCGCT 
GGTAATAGCG 
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FIGURE 12H 

ATCACTAATA CGTAGATGTA CTGCCAA6TA GGAAAGTCCC ATAAGGTCAT 
GTACTGGGCA TAATGCCAGG CGG6CCATTT ACCGTCATTG ACGTCAATAG 
GGGGCGTACT TGGCATATGA TACACTTGAT GTACTGCCAA 6TGGGCAGTT 
TACCGTAAAT ACTCCACCCA TTGACGTCAA TGGAAAGTCC CTATTGGCGT 
TACTATGG6A ACATACGTCA TTATTGACGT CAATGGGCGG GGGTCGTTGG 
GCGGTCAGCC AGGCGGGCCA TTTACCGTAA GTTATGTAAC GACCTGCAGG 
TCGACTCTAG AGGATCTCCC TAGACAAATA TTACGCGCTA TGAGTAACAC 
AAAATTATTC AGATTTCACT TCCTCTTATT CAGTTTTCCC CCGAAAATGG 
CCAAATCTTA CTCGGTTACG CCCAAATTTA CTACAACATC CGCCTAAAAC 
CGCGCGAAAA TTGTCACTTC CTGTGTACAC CGGCGCACAC CAAAAACGTC 
ACTTTTGCCA CATCCGTCGC TTACATGT6T TCCGCCACAC TTGCAACATC 
ACACTTCC6C CACACTACTA CGTCACCCGC CCCGTTCCCA CGCCCCGCGC 
CACGTCACAA ACTCCACCCC CTCATTATCA TATTGGCTTC AATCCAAAAT 
AAGGTATATT ATTGATGATG CTAGCGGGGC CCTATATATG GATCCAATTG 
CAATGATCAT CATGACAGAT CTGCGCGCGA TCGATATCAG CGCTTTAAAT 
TTGCGCATGC TAGCTATAGT TCTAGAGGTA CCGGTTGTTA ACGTTAGCC6 
GCTACGTATA CTCCGGAATA TTAATAGGCC TAGGATGCAT ATGGCGGCC6 
GCCGCCTGCA GCTGGCGCCA TCGATACGCG TACGTCGCGA CCGCGGACAT 
GTACAGAGCT CGAGAAGTAC TAGTGGCCAC GT6GGCCGTG CACCTTAAGC 
TTGGCACTGG CCGTCGTTTT ACAACGTCGT GACTGGGAAA ACCCTGGCGT 
TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA 
ATAGCGAAGA GGCCCGCACC GATCGCCCTT CCCAACAGTT GCGCAGCCTG 
AATGGCGAAT GGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCG6 
TATTTCACAC CGCATACGTC AAAGCAACCA TAGTACGCGC CCTGTAGCGG 
CGCATTAAGC GCGGCXK^GTG TGGTGGTTAC GCGCAGCGTG ACCGCTACAC 
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TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG CTTTCTTCCC TTCCTTTCTC 


16500 


6CCACGTTCG 


CCGGCTTTCC 


CCGTCAA6CT 


CTAAATC6G6 


GGCTCCCTTT 


16550 


AGGGTTCCGA 


TTTAGTGCTT 


TACGGCACCT 


CGACCCCAAA AAACTTGATT 


16600 


TGGGTGATG6 


TTCACGTAGT 


6GGCCATCGC 


CCTGATA6AC GGTTTTTC6C 


16650 


CCTTTGACGT 


TGGAGTCCAC 


GTTCTTTAAT AGTGGACTCT TGTTCCAAAC 


16700 


TGGAACAACA 


CTCAACCCTA 


TCTCGGGCTA 


TTCTTTTGAT 


TTATAAGG6A 


16750 


TTTT6CCGAT 


TTCGGCCTAT 


TGGTTAAAAA 


ATGAGCT6AT 


TTAACAAAAA 


16800 


TTTAAC6CGA ATTTTAACAA AATATTAACG TTTACAATTT TATGGTGCAC 


16850 


TCTCAGTACA 


ATCTGCTCT6 


ATGCCGCATA 


GTTAAGCCA6 


CCCC6ACACC 


16900 


CGCCAACACC 


CGCTGACGCG 


CCCTGACGGG 


CTTGTCTGCT 


CCCGGCATCC 


16950 


GCTTACAGAC 


AAGCTGTGAC 


CGTCTCCGGG 


AGCTGCATGT 


GTCA6AGGTT 


17000 


TTCACCGTCA TCACCGAAAC GCGCGAGAC6 AAAGGGCCTC GT6ATAC6CC 


17050 


TATTTTTATA 


GGTTAATGTC 


ATGATAATAA 


TGGTTTCTTA 


GACGTCAG6T 


17100 


G6CACTTTTC 


GGGGAAAT6T 


GCGC6GAACC 


CCTATTTGTT 


TATTTTTCTA 


17150 


AATACATTCA 


AATATGTATC 


CGCTCATGAG 


ACAATAACCC 


TGATAAAT6C 


17200 


TTCAATAATA 


TTGAAAAAG6 


AAGA6TAT6A 


GtATTCAACA 


TTTCCGT6TC 


17250 


6CCCTTATTC . 


CCTTTTTTGC 


GGCATTTTGC 


CTTCCTGTTT 


TTGCTCACCC 


17300 


AGAAACGCT6 


GTGAAAGTAA 


AAGATGCTGA 


AGATCAGTT6 


GGTGCACGAG 


17350 


TGGGTTACAT 


CGAACTGGAT 


CTCAACAGOG 


GTAAGATCCT 


TGAGA6TTTT 


17400 


CGCCCCGAAG 


AACGTTTTCC 


AATGATGAGC 


ACTTTTAAAG 


TTCTGCTATG 


17450 


TGGCGCGGTA 


TTATCCCGTA 


TTGACGCCGG 


GOAAGAGCAA 


CTCGGTCGCC 


17500 


6CATACACTA 


TTCTCAGAAT 


GACTTGGTTG 


AGTACTCACC 


AGTCACAGAA 


17550 


AA6CATCTTA 


CGGATGGCAT 


GACAGTAAGA 


GAATTATGCA 


GTGCTGCCAT 


17600 


AACCATGAGT 


GATAACACTG 


CGGCCAACTT 


ACTTCTGACA 


ACGATCG6A6 


17650 


GACCGAAGGA 


GCTAACCGCT 


TTTTTGCACA 


ACATGGGGGA 


TCATGTAACT 


17700 
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CGCCTTGATC GTTGGGAACC 
GCGTGAOVCC ACGATGCCTG 
TAACTGGCGA ACTACTTACT 
ATGGAGGCGG ATAAAGTTGC 
TGGCTGGTTT ATTGCTGATA 
GTATCATTGC AGCACTGGG6 
ATCTACaCGA CGGGGAGTCA 
CGCTGAGATA GGTGCCTCAC 
TTTACTCATA TATACTTTAG 
AGGATCTAGG TGAAGATCCT 
ACGTGAGTTT TCGTTCCACT 
GATCTTCTT6 AGATCCTTTT 
AAAAAACCAC CGCTACCAGC 
AACTCTTTTT CCGAAGGTAA 
CTGTTCTTCT AGTGTAGCCG 
GCACCGCCTA CATACCTCGC 
CAGTGGCGAT AAGTCGTGTC 
CGGATAAGGC GCA6CGGTCG 
AGCTTGGAGC GAACGACCTA 
ATGAGAAAGC GCCACGCTTC 
TAAGCGGCAG GGTCGGAACA 
AACGCCTGGT ATCTTTATA6 
GCGTCGATTT TTGTGATGCT 
CCAGCAACGC GGCCTTTTTA 
CACATGTTCT TTCCTGCGTT 
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FIGURE 120 

GGAGCTGAAT GAAGCCATAC 
TAGCAATGGC AACAACGTTG 
CTAGCTTCCC GGCAACAATT 
AGGACCACTT CTGCGCTCGG 
AATCTGGAGC CGGTGAGCGT 
CCAGATGGTA AGCCCTCCCG 
GGCAACTATG GATGAACGAA 
TGATTAAGCA TTGGTAACTG 
ATTGATTTAA AACTTCATTT 
TTTTGATAAT CTCATGACCA 
GAGCGTCAGA CCCCGTAGAA 
TTTCTGCGCG TAATCTGCTG 
GGTGGTTTGT TTGCC6GATC 
CTGGCTTCAG CAGAGC6CAG 
TAGTTAGGCC ACCACTTCAA 
TCTGCTAATC CTGTTACCAG 
TTACCGGGTT GGACTCAAGA 
GGCTGAACGG GGGGTTCGTG 
CACCGAACTG AGATACCTAC 
CCGAAGGGAG AAAGGCGGAC 
GGAGAGCGCA CGAGGGAGCT 
TCCTGTCGGG TTTCGCCACC 
CGTCAGGGGG GCGGAGCCTA 
CGGTTCCTGG CCTTTTGCTG 
ATCCCCTGAT TCTGTGGATA 
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CAAACGAC6A 


17750 


CGCAAACTAT 


17800 


AATAGACT66 


17850 


CCCTTCCGGC 


17900 


GGGTCTCGCG 


17950 


TATCGTAGTT 


18000 


ATAGACAGAT 


18050 


TCAGACCAAG 


18100 


TTAATTTAAA 


18150 


AAATCCCTTA 


18200 


AAGATCAAAG 


18250 


CTTGCAAACA 


18300 


AAGAGCTACC 


18350 


ATACCAAATA 


18400 


GAACTCTGTA 


18450 


TGGCTGCTGC 


18500 


CGATAGTTAC 


18550 


CACACA6CCC 


looOU 


AGCGTGA6CT 


18650 


AGGTATCCGG 


18700 


TCCAGGGG6A 


18750 


TCTGACTT6A 


18800 


TGGAAAAACG 


18850 


GCCTTTTGCT 


18900 


ACCGTATTAC 


18950 
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FIGURE 12P 

CGCCTTTGA6 T6AGCTGATA CCGCTC6CC6 CAGCC6AAC6 ACCGAGC6CA 19000 

GCGAGTCAGT GAGCGAG6AA GCGGAA6A6C GCCCAATAC6 CAAACCGCCT 19050 

CTCCCC6CGC GTTGGCCGAT TCATTAATGC A6CTGGCAC6 ACAGGTTTCC 19100 

CGACTGGAAA GCGG6CAGTG AGCGCAACGC AATTAATGTG AGTTAGCTCA 19150 

CTCATTAGGC ACCCCAGGCT TTACACTTTA TGCTTCCGGC TCGTATGTTG 19200 

TGTGGAATTG TGAGCGGATA ACAATTTCAC ACAGGAAACA GCTATGACCA 19250 

TGATTAC6AA TTCGAAT6GC CATGGGAC6T CGACCT6AGG TAATTATAAC 19300 

CCGGGCC 19307 
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